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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
CONTINUING APPLICATION TRANSMITTAL UNDER 37 CFR § 1.53(b) 

Box Patent Application £ 
Assistant Commissioner for Patents *g 
Washington, D.C. 20231 ^ 



Sir: u 
This is a request under 37 CFR §1 .53 for filing a 

is continuation application. 
□ divisional application. 

1 . Particulars of Prior Application 

Application Serial No: 09/136,389 

U.S. Filing Date: August 18, 1998 

Title: PROTEINS ENCODING GELONIN SEQUENCES 

Art Unit: 1 645 

Examiner: A. Salimi 

Prior Docket No.: 11 022US1 0 / 200-70.P4.C1 



PFRTIFICATION UN DFR 37 CFR §1.10 

, hereby certify that this Continuing Application Transmittal Under 37 CFR SI 53(bl I and the 
documents referred to as enclosed therewith are be.ng depos.ted w.th the United I States Postal 
™n 6, 2000 in an envelope addressed to BOX PATENT APPLICAT ON. Assistant 
Comm^ioner y for'patents, Washington, D.C. 20231 utilizing the "Express , Ma,. Post : Ofh« > to 
Addressee" service of the United States Postal Serv,ce under Maihng Label No. EL567394949US. 

Janet M. McNicholas, Ph.D. 
Registration No. 32,918 



2. This request is filed by: 



1. 

Full Name of 
Inventor 


Family Name 
Better 


First Given Name 
Marc 


Second Given Name 
D. 


Residence & 
Citizenship 


City 

Los Angeles 


State or Foreign 
Country 

California 


Country of Citizenship 
United States 


Post Office 
Address 


Post Office Address 
2462 Zorada Drive 


City 

Los Angeles 


State & Zip Code/Country 
California 90046 


2. 

Full Name of 
Inventor 


Family Name 
Carroll 


First Given Name 
Stephen 


Second Given Name 
F. 


Residence & 
Citizenship 


City 

Walnut Creek 


State or Foreign 
Country 

California 


Country of Citizenship 
United States 


Post Office 
Address 


Post Office Address 

1 308 Milton Avenue 


City 

Walnut Creek 


State & Zip Code/Country 
California 94596 


3. 

Full Name of 
Inventor 


Family Name 


First Given Name 


Second Given Name 


Residence & 
Citizenship 


City 


State or Foreign 
Country 


Country of Citizenship 


Post Office 
Address 


Post Office Address 


City 


State & Zip Code/Country 



□ This application is being filed by less than all the inventors named in the 
prior application. An accompanying statement requests deletion of the 
name(s) of the person(s) who are not inventors of the invention being 
claimed in this application. 



3. 



Amendments 



la Amend the specification by inserting before the first line the 
sentence: 

-- This is a Continuation of U.S. Application No. 09/136,389 filed 
August 18, 1998. 
o Cancel claims in the prior application before calculating the filing 
fee. 

□ A Preliminary Amendment is enclosed. 

□ The filing fee is based upon entry of the foregoing amendments) (if 
any). 

4. Copy of Prior Application 

The enclosed is a copy of the prior complete application, including the 
specification (with claims), drawings, the oath or declaration, and any 
amendments referred to in the oath or declaration filed to complete the 
prior application. 

5. Incorporation By Reference 

The entire disclosure of the prior application, from which a copy of the oath 
or declaration is supplied under paragraph 4, above, is considered as being 
part of the disclosure of the accompanying application and is hereby 
incorporated by reference therein. 

6. Priority 

□ Priority of application No. , filed on 

in is claimed under 35 

USC §119. 

□ The certified copy(ies) was(were) filed in prior U.S. 
Application No. . 

□ The certified copy(ies) has(have) not been filed. 
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7. Assignment 



h The prior application is assigned of record to XOMA Corporation and 
has been recorded at Reel No. 9153, Frame No. 0378 (7 pages). 

8. Small Entity Status 

□ Verified statement(s) claiming small entity status is(are) attached. 

la Small entity status has been established in the prior application and 
is still proper and desired. 



9. Fee Calculation 



CLAIMS AS FILED - INCLUDING PRELIMINARY AMENDMENT (IF AISM 


f) 






SMALL ENTITY 


OTHER THAN A 
ENTITY 


i SMALL 




NO. FILED 


NO. 
EXTRA 


RATE 


FEE 


RATE 


FEE 


BASIC FEE 








$345.00 




$690.00 


TOTAL 






X 9 = 


$ 


X 18 = 


$ 


INDEP. 






X 39 = 


$ 


X 78 = 


$ 


□ First Presental 


ion of Multiple Dependent Claim 


+ 130 = 


$ 


+ 260 - 


$ 


Filing Fee: 


$ 


OR 


$ 



10. Method of Payment of Fees 

□ Attached is a check in the amount of: 

□ Charge Deposit Account No. 13-0017 in the amount of:$ 
A copy of this Transmittal is enclosed. 

ei This continuation application is being filed without a fee. 



1 1 . Deposit Account and Refund Authorization 

The Commissioner is hereby authorized to charge any deficiency in the 
amount enclosed or any additional fees, other than the application filing fee, 
which may be required during the pendency of this application under 37 CFR 
§1.16 or 37 CFR §1.17 to Deposit Account No. 13-0017. A copy of this 
Transmittal is enclosed. Please refund any overpayment to McAndrews, 
Held & Malloy, Ltd. at the address below. 

Please direct all future communications to: 

Janet M. McNicholas, Ph.D. 
McAndrews, Held & Malloy, Ltd. 
500 W. Madison Street, 34 th Floor 
Chicago, Illinois 60661 

Respectfully submitted, 

McANDREWS, HELD & MALLOY, Ltd. 

500 W. Madison Street, 34 th Floor 

Chicago, Illinois 60661 

(312) 775-8000 

(312) 775-8100 (Facsimile) 

* 

Date: July 6, 2000 By: VjoLXMa^ ful . AA<^icJ&^^X 

Janet M. McNicholas, Ph.D. 
Reg. No. 32,918 



JOINT INVENTORS 

"EXPRESS MAIL" Mail Label No. EL567394949US. 
Date of Deposit: July 6, 2000 

I hereby certify that this paper (or fee) is being 
deposited with the United States Postal Service 
"EXPRESS MAIL POST OFFICE TO ADDRESSEE" 
service under 37 CFR §1.10 on the date indicated 
above and is addressed to: BOX PATENT 
APPLICATION, Assistant Commissioner for Patents, 
Washington, D.C. 20231 

Janet M. McNicholas, Ph.D. 
Registration No. 32,918 



APPLICATION FOR 
UNITED STATES LETTERS PATENT 

SPECIFICATION 

Attorney's Docket No. 11022US11 / 200-70.P4.C2 

TO ALL WHOM IT MAY CONCERN: 

Be it known that we, MARC D. BETTER, a citizen of the United States 
residing at 2462 Zorada Drive, Los Angeles, California 90046, and STEPHEN 
F. CARROLL, a citizen of the United States residing at 1308 Milton Avenue, 
Walnut Creek, California 94596, have invented new FUSION PROTEINS AND 
POLYNUCLEOTIDES ENCODING GELONIN SEQUENCES, of which the 
following is a specification. 



FUSION PROTEINS AND POLYNUCLEOTIDES ENCODING 

GELONIN SEQUENCES 

FIELD OF THE INVENTION 
This application is a continuation of co-pending 
U.S. Patent Application Serial No. 08/646,360 filed May 12, 
1994, which is a continuation-in-part of U.S. Patent 
Application Serial No. 08/064,691 filed May 12, 1993 (now 
abandoned), which is a continuation-in-part of U.S. Patent 
Application Serial No. 07/988,430 filed December 9, 1992 
(now U.S. Patent No. 5,416,202), which is in turn a 
continuation-in-part of U.S. Patent Application Serial 
No. 07/901,707 filed June 19, 1992 (now U.S. Patent 
No. 5,376,546), which in turn is a continuation-in-part of 
U.S. Patent Application Serial No. 07/787,567 filed 
November 4, 1991 (now abandoned) . 

The present invention generally relates to 
materials useful as components of cytotoxic therapeutic 
agents. More particularly, the invention relates to 
ribosome- inactivating proteins, to analogs of ribosome- 
inactivating proteins, to polynucleotides encoding such 
proteins and analogs, some of which are specifically 
modified for conjugation to targeting molecules, and to 
gene fusions of polynucleotides encoding ribosome- 
inactivating proteins to polynucleotides encoding targeting 
molecules . 

BACKGROUND 

Ribosome- inactivating proteins (RIPs) comprise a 
class of proteins which is ubiquitous in higher plants. 
However, such proteins have also been isolated from 
bacteria. RIPs are potent inhibitors of eukaryotic protein 
synthesis. The N-glycosidic bond of a specific adenine 
base is hydrolytically cleaved by RIPs in a highly 
conserved loop region of the 28S rRNA of eukaryotic 
ribosomes thereby inactivating translation. 

Plant RIPs have been divided into two types. 
Stirpe et al., FEBS Lett., 19S(1,2):1-B (1986). Type I 
proteins each consist of a single peptide chain having 
ribosome-inactivating activity, while Type II proteins each 
consist of an A- chain, essentially equivalent to a Type I 
protein, disulfide- linked to a B-chain having cell-binding 
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propert ies . Gelonin , dodecandr in , tr icosanthin , 

tricokirin, bryodin, Mirabilis antiviral protein (MAP) , 
barley ribosome- inactivating protein (BRIP) , pokeweed 
antiviral proteins (PAPs) , saporins, luff ins, and momordins 
5 are examples of Type I RIPs; whereas Ricin and abrin are 

examples of Type II RIPs. 

Amino acid sequence information is reported for 
various ribosome- inactivating proteins. It appears that at 
least the tertiary structure of RIP active sites is 

10 conserved among Type I RIPs, bacterial RIPs, and A-chains 

of Type II RIPs. In many cases, primary structure homology 
is also found. Ready et ai., J • Biol. Chem., 
259 (24) : 15252-15256 (1984) and other reports suggest that 
the two types of RIPs are evolutionarily related. 

15 Type I plant ribosome-inactivating proteins may 

be particularly suited for use as components of cytotoxic 
therapeutic agents. A RIP may be conjugated to a targeting 
agent which will deliver the RIP to a particular cell type 
in vivo in order to selectively kill those cells. 

20 Typically, the targeting agent (e.g., an antibody) is 
linked to the toxin by a disulfide bond which is reduced in 
vivo allowing the protein toxin to separate from the 
delivery antibody and become active intracellular ly . 
Another strategy for producing targeted cytotoxic proteins 

25 is to express a gene encoding a cytotoxic protein fused to 

a gene encoding a targeting moiety. The resulting protein 
product is composed of one or more polypeptides containing 
the cytotoxic protein linked to, for example, at least one 
chain of an antibody. 

30 A variety of such gene fusions are discussed in 

Pastan et ml.. Science, 254:1173-1177 (1991). However, 
these fusion proteins have been constructed with sequences 
from diphtheria toxin or Pseudomonas aeruginosa exotoxin A, 
both of which are ADP-ribosyltransferases of bacterial 
35 origin. These protein toxins are reported to intoxicate 
cells and inhibit protein synthesis by mechanisms which 
differ from those of the RIPs. Moreover, diphtheria toxin 
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and exotoxin A are structurally different from, and show 
little amino acid sequence similarity with, RIPs. in 
general, fusion proteins made with diphtheria toxin or 
exotoxin A have been immunogenic and toxic in animals, and 
5 are produced intracellular ly in relatively low yield. 

Another strategy for producing a cytotoxic agent is to 
express a gene encoding a RIP fused to a gene encoding a 
targeting moiety. The resulting protein product is a 
single polypeptide containing, a RIP linked to, for example, 
10 at least one chain of an antibody. 

Because some RIPs, such as the Type I RIP 
gelonin, are primarily available from scarce plant 
materials, it is desirable to clone the genes encoding the 
RIPs to enable recombinant production of the proteins. It 
15 is also desirable to develop analogs of the natural 

proteins which may be easily conjugated to targeting 
molecules while retaining their natural biological activity 
because most Type I RIPs have no natural sites (i.e. 
available cysteine residues) for conjugation to targeting 
20 agents. Alternatively, it is desirable to develop gene 

fusion products including Type I RIPs as a toxic moiety and 
antibody substances as a targeting moiety. 

The present invention also provides novel 
humanized or human-engineered antibodies and methods for 
25 producing such antibodies which may be conjugated or fused 
to various toxins. Such conjugations or fusions are useful 
in the treatment of various disease states, including 
autoimmune diseases and cancer. 

There are several reports relating to replacement 
30 of amino acids in a mouse antibody with amino acids 
normally occurring at the analogous position in the human 
form of the antibody. See, e.g., Junghaus, et al., Cancer 
Res., 50: 1495-1502 (1990) and other publications which 
describe genetically-engineered mouse/human chimeric 
35 antibodies. Also by genetic engineering techniques, the 
genetic information from murine hypervariable 
complementarity determining regions (hereinafter referred 
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to as "CDRs") may be inserted in place of the DNA encoding 
the CDRs of a human monoclonal antibody to generate a 
construct encoding a human antibody with murine CDRs. See, 
e.g., Jones, et al., Nature, 321: 522-525 (1986). 
5 Protein structure analysis on such M CDR-graf ted w 

antibodies may be used to "add back" murine residues in 
order to restore lost antigen-binding capability, as 
described in Queen, et al, Proc. Natl. Acad. Sci. (USA), 
35:10029-10033 (1989); Co, et al., Proc. Nat. Acad. Sci. 

10 (USA), 88: 2869-2873 (1991). However, a frequent result of 

CDR-grafting is that the specific binding acitvity of the 
resulting humanized antibodies may be diminished or 
completely abolished. 

As demonstrated by the foregoing, there exists a 

15 need in the art for cloned genes encoding Type I RIPs, for 

analogs of Type I RIPs which may be easily conjugated to 
targeting molecules, and for gene fusion products 
comprising Type I RIPs, and especially wherein such gene 
fusions also comprise an humanized antibody portion. 



20 flmQUftY OF TOB IKVnTPIOH 

The present invention provides purified and 
isolated polynucleotides encoding Type I RIPs, Type I RIPs 
having a cysteine available for disulfide bonding to 
targeting molecules and fusion products comprising Type I 

25 RIPs. Vectors comprising the polynucleotides and host 
cells transformed with the vectors are also provided. 

A purified and isolated polynucleotide encoding 
natural sequence gelonin (SEQ ID NO: 11) , and a host cell 
including a vector encoding gelonin of the type deposited 

30 as ATCC Accession No. 68721 are provided. Further provided 

are a purified and isolated polynucleotide encoding natural 
sequence barley ribosome-inactivating protein and a 
purified and isolated polynucleotide encoding momordin II. 

Some of the polynucleotides mentioned above 

35 encode fusion proteins of the present invention comprising 
gelonin (SEQ ID NO: 2) or another RIP and an antibody or a 
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fragment comprising an antigen-binding portion thereof* 
Several alternate forms of fusion proteins comprising 
gelonin are contemplated herein* For example, the fusion 
protein may contain a single RIP fused to a monovalent 
antibody binding moiety, such as a Fab or single chain 
antibody . Alternatively, multivalent forms of the fusion 
proteins may be made and may have greater activity than the 
monovalent forms. In preferred embodiments of the 
. invention, gelonin may be fused to either the car boxy or 
the amino terminus of the antibody or antigen-binding 
: portion of thereof. Also in a preferred embodiment of the 
invention, the antibody or fragment thereof comprising an 
antigen-binding portion may be an he3 antibody, an he3-Fab, 
an he3 Fd, single-chain antibody, or an he3 kappa fragment. 
15 The antibody or antigen-binding portion thereof may be 

fused to gelonin by means of a linker peptide, preferably* 
a peptide segment of shiga-like toxin as shown in SEQ ID 
NO: 56 or a peptide segment of Rabbit muscle aldolase as 
shown in SEQ ID NO: 57 or a human muscle aldolase, an 
20 example of which is reported in Izzo, et al., Eur. J. 

Biochem, 174: 569-578 (1988), incorporated by reference 
herein. 

Analogs of a Type I plant RIP are defined herein 
as non-naturally occurring polypeptides that share the 

25 ribosome-inactivating activity of the natural protein but 

that differ in amino acid sequence from the natural type I 
RIP protein in some degree but less than they differ from 
the amino acid sequences of other Type I plant RIP. 
Preferred analogs according to the present invention are 

30 analogs of Type I plant RIPs each having a cysteine 
available for disulfide bonding located at a position in 
its amino acid sequence from the position corresponding to 
position 251 in SEQ ID NO: 1 to the carboxyl terminal 
position of the analog. SEQ ID NO: 1 represents the amino 

35 acid sequence of ricin A-chain. Other preferred analogs 

according to the invention are Type I RIPs each having a 
cysteine available for disulfide bonding at a position in 
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the analog that is on the surface of the protein in its 
natural conformation and that does not impair native 
folding or biological activity of the ribosome- inactivating 
protein * Analogs of bacterial RIPs are also contemplated 
5 by the present invention. 

The present invention provides an analog of a 
Type I ribosome- inactivating protein, which analog has a 
cysteine available for intermolecular disulfide bonding at 
an amino acid position corresponding to a position not 

10 naturally available for intermolecular disulfide bonding in 

the Type I ribosome- inactivating protein and which cysteine 
is located at a position in the amino acid sequence of the 
analog corresponding to position 259 in SEQ ID NO: 1 or at 
a position in the amino acid sequence in the analog 

15 corresponding to a position from position 251 in SEQ ID NO: 

1 to the carboxyl terminal position of the analog. 

An analog according to the present invention may 
be an analog of gelonin. In an analog of gelonin according 
to the present invention, the cysteine may be at a position 

20 in the analog from position 244 to the carboxyl terminal 

position of the analog , more preferably at a position in 
the analog from position 247 to the carboxyl terminal 
position of the analog, and most preferably at position 
244, at position 247, or at position 248 of the amino acid 

25 sequence of the analog. In these analogs, it is preferred 
that the gelonin cysteine residues at positions 44 and 50 
be replaced with non-cysteine residues such as alanine. 

An analog according to the present invention may 
be an analog of barley ribosome- inactivating protein. 

30 Preferably, a cysteine in such an analog is at a position 
in the analog from position 256 to the carboxyl terminal 
position, and more preferably the cysteine is at a position 
in the amino acid sequence of the analog from position 260 
to the carboxyl terminal position of the analog. Most 

35 preferably, in these regions, the cysteine is at position 
256, at position 270, or at position 277 of the amino acid 
sequence of the analog. 



WO 94/26910 PCT/US94/05348 



-7- 



An analog according to the present invention may 
be an analog of momordin II. 

Analogs according to the present invention may 
have a cysteine in the amino acid sequence of the analog at 
5 a position which corresponds to a position within one amino 

acid of position 259 of SEQ ID NO: 1. Such an analog may 
be an analog of gelonin, of barley ribosome- inactivating 
protein, or of momordin II. 

The present invention also provides a 
10 polynucleotide encoding an analog of a Type I ribosome- 

inactivating protein, which analog has a cysteine available 
for intermolecular disulfide bonding at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
15 I ribosome- inactivating protein, and which cysteine is 

located at a position in the amino acid sequence of the- 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog. The polynucleotide may encode an analog of 
gelonin, preferably an analog wherein the cysteine is at a 
position in the amino acid sequence of the analog from 
position 244 to the carboxyl terminal position of the 
analog , more preferably wherein the cysteine is at a 
position in the analog from position 247 to the carboxyl 
25 terminal position of the analog, and most preferably the 

cysteine is at position 244, at position 247 or at position 
248 of the amino acid sequence of the analog. It is 
preferred tfcat a polynucleotide according to the present 
invention encode a gelonin analog wherein the native 
30 gelonin cysteine residues at positions 44 and 50 are 
replaced with non-cysteine residues, such as alanine. 

A polynucleotide according to the present 
invention may encode an analog of barley ribosome- 
inactivating protein, preferably an analog wherein the 
35 cysteine is at a position in the analog from position 256 

to the carboxyl terminal position of the analog, more 
preferably wherein the cysteine is at a position in the 
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analog from position 2 60 to the carboxyl terminal position 
of the analog, and most preferably wherein the cysteine is 
at position 256, at position 270 or at position 277 of the 
amino acid sequence of the analog. 
5 A polynucleotide according to the present 

invention may encode an analog of mormordin II. 

The present invention provides a vector including 
a polynucleotide encoding an analog of a Type I ribosome- 
inactivating protein, which analog has a cysteine available 
10 for intermolecular disulfide bonding at a amino acid 

position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome- inactivating protein and which cysteine is 
located at a position in the amino acid sequence of the 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the - 
analog. 

The present invention further provides a host 
cell including a DNA vector encoding an analog of a Type I 
ribosome- inactivating protein, which analog has a cysteine 
available for intermolecular disulfide bonding at an amino 
acid position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and which cysteine is 
located at a position in the amino acid sequence of the 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog. In such a host cell the vector may encode an 
analog of gelonin, especially an analog wherein the 
cysteine is at position 247 of the amino acid sequence of 
the analog, such as in the host cell deposited as ATCC 
Accession No. 69009. 

A host cell according to the present invention 
may include a vector encoding barley ribosome-inactivating 
35 protein, especially preferred is a host cell containing a 
BRIP analog wherein the cysteine is at position 277, such 
as in the host cell deposited on October 2, 1991 with the 
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American Type Culture Collection, 123 01 Parklawn Drive, 
Rockville, MD 20852 as ATCC Accession No* 68722. 
Particularly preferred are prokaryotic host cells because 
such cells may be less sensitive to the action or RIPs as 
5 compared to eukaryotic cells. 

The present invention also provides an agent 
toxic to a cell including an analog of a Type I ribosome- 
inactivating protein linked by a disulfide bond through a 
cysteine to a molecule which specifically binds to the 

10 cell, which cysteine is at an amino acid position in the 

analog corresponding to a position not naturally available 
for intermolecular disulfide bonding in the Type I 
ribosome- inactivating protein and which cysteine is located 
in the amino acid sequence of the analog from the position 

15 corresponding to position 251 in SEQ ID NO: l to the 
carboxyl terminal position of the analog. The agent may- 
include an analog of gelonin, preferably an analog wherein 
the cysteine is at a position in the analog from position 

247 to the carboxyl terminal position of the analog, and 
20 more preferably wherein the cysteine is at position 247 or 

248 of the amino acid sequence of analog. An agent 
including an analog wherein the native gelonin cysteine 
residues at positions 44 and 50 are replaced with non- 
cysteine residues, such as alanine is preferred. 

25 An agent according to the present invention may 

include an analog of barley ribosome- inactivating protein, 
preferably an analog wherein the cysteine is at a position 
in the analog from position 260 to the carboxyl terminal 
position of the analog, more preferably wherein the 

30 cysteine is at a position in the analog from position 270 
to the carboxyl terminal position of the analog, and most 
preferably wherein the cysteine is at position 256, at 
position 270 or at position 277 of the amino acid sequence 
of the analog. 

35 An agent according to the present invention may 

include an analog of momordin IX. 
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The present invention provides an agent wherein 
the Type I ribosome-inactivating protein is linked to an 
antibody, particularly to an H65 antibody or to an antibody 
fragment, more particularly to an antibody fragment 
5 selected from the group consisting of chimeric and human 

engineered antibody fragments, and most particularly to a 
Fab antibody fragment, a Fab' antibody fragment or a F(ab») 2 
antibody fragment. It is highly preferred that an agent 
according to the present invention include a chimeric or 

10 human engineered antibody fragment selected from the group 

consisting of a Fab antibody fragment, a Fab 1 antibody 
fragment and a F(ab f )2 antibody fragment. 

A method according to the present invention for 
preparing an analog of a Type I ribosome-inactivating 

15 protein includes the step of expressing in a suitable host 

cell a polynucleotide encoding a Type I ribosome- - 
inactivating fusion protein or type I RIP (especially 
gelonin) having a cysteine available for intermolecular 
disulfide bonding substituted (e.g., by site-directed 

20 mutagenesis of the natural DNA sequence encoding the RIP or 
by chemical synthesis of a ONA sequence encoding the RIP 
analog) at an amino acid position corresponding to a 
position not naturally available for intermolecular 
disulfide bonding in the Type I ribosome-inactivating 

25 protein, which cysteine is located at a position in the 
amino acid sequence of the analog from the position 
corresponding to position 251 in SEQ ID NO: 1 to the 
carboxyl terminal position of the analog . 

A product according to the present invention may 

30 be a product of a method including the step of expressing 
in a suitable host cell a polynucleotide encoding a Type I 
ribosome-inactivating protein having a cysteine available 
for intermolecular disulfide bonding substituted at an 
amino acid position corresponding to a position not 

35 naturally available for intermolecular disulfide bonding in 
the Type I ribosome-inactivating protein, which cysteine is 
located at a position in the amino acid sequence of the 
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analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 
analog* 

The present invention provides a method for 
5 preparing an agent toxic to a cell including the step of 

linking an analog of a Type I ribosome-inactivating protein 
through a cysteine to a molecule which specifically binds 
to the cell, which analog has the cysteine at an amino acid 
position corresponding to a position not naturally 

10 available for intermolecular disulfide bonding in the Type 

I ribosome- inactivating protein and which cysteine is 
located at a position in the amino acid sequence of the 
analog from the position corresponding to position 251 in 
SEQ ID NO: 1 to the carboxyl terminal position of the 

15 analog. 

According to the present invention, a method f or - 
treating a disease in which elimination of particular cells 
is a goal may include the step of administering to a 
patient having the disease a therapeutically effective 

20 amount of an agent toxic to the cells including a type I 

RIP (especially gelonin fused to or an analog of a Type I 
ribosome- inactivating protein linked through a cysteine to 
a molecule which specifically binds to the cell, the analog 
having the cysteine at an amino acid position corresponding 

25 to a position not naturally available for intermolecular 
disulfide bonding in the Type I ribosome-inactivating 
protein and the cysteine being located at a position in the 
amino acid sequence of the analog from the position 
corresponding to position 251 in SEQ ID NO: 1 to the 

30 carboxyl terminal position of the analog. 

Useful target cells for immunotoxins of the 
present invention include, but are not limited to, cells 
which are pathogenic, such as cancer cells, autoimmune 
cells, and virally- infected cells. Such pathogenic cells 

35 may be targeted by antibodies or other targeting agents of 
the invention which are joined, either by genetic 
engineering techniques or by chemical cros s-1 inking , to an 
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RIP. specifically useful targets include tumor-associated 
antigens (e.g., on cancer cells), cell differentiation 
markers (e.g., on autoimmune cells), parasite-specific 
antigens, bacteria-specific antigens, and virus-specific 
5 antigens . 

The present invention also provides an analog of 
a Type I ribosome- inactivating protein, wherein the analog 
has a cysteine available for intermolecular disulfide 
bonding located at an amino acid position corresponding to 

10 a position not naturally available for intermolecular 
disulfide bonding in the Type I ribosome- inactivating 
protein and corresponding to a position on the surface of 
ricin A-chain in its natural conformation, and wherein the 
analog retains the ribosome- inactivating activity of the 

15 Type I ribosome-inactivating protein. 

Such a - fusion protein or an analog may be a . 
fusion protein or an analog wherein the Type I ribosome 
inactivating protein is gelonin, and the analog is 
preferably an analog of gelonin wherein the cysteine is at 

20 position 10 of the amino acid sequence of the analog as 
encoded in a vector in a host cell deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, MD 20852 as ATCC Accession No. 69008 on June 9, 
1992. Other such gelonin analogs include those wherein the 

25 cysteine is at a position 60, 103, 146, 184 or 215 in the 

amino acid sequence of the gelonin analog. It is preferred 
that the gelonin cysteine residues at positions 44 and 50 
be replaced with non-cysteine residues such as alanine in 
these analogs. 

30 The present invention further provides an analog 

of a Type I ribosome-inactivating protein wherein the 
analog includes only a single cysteine. Such an analog may 
be an analog of gelonin and is preferably an analog wherein 
the single cysteine is at position 10, position 44, 

35 position 50 or position 247 in the amino acid sequence of 

the analog, but the cysteine may be located at other 
positions defined by the invention as well. 



WO 94/26910 



PCT/US94/0534S 



-13- 

The present invention provides a polynucleotide 
encoding an analog of a Type I ribosome- inactivating 
protein, wherein the analog has a cysteine available for 
intermolecular disulfide bonding located at an amino acid 
5 position corresponding to a position not naturally 

available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and corresponding to a 
position on the surface of ricin A-chain in its natural 
conformation, and wherein the analog retains ribosome- 
10 inactivating activity of the Type I ribosome-inactivating 

protein. 

According to the present invention, a method for 
preparing an analog of a Type I ribosome-inactivating 
protein may include the step of expressing in suitable host 

15 cell a polynucleotide encoding a Type I ribosome- 

inactivating protein having a cysteine available for . 
intermolecular disulfide bonding substituted at an amino 
acid position corresponding to a position not naturally 
available for disulfide bonding in the Type I ribosome- 

20 inactivating protein, the cysteine is located at a position 

corresponding to an amino acid position on the surface of 
ricin A-chain in its natural conformation and which analog 
retains ribosome-inactivating activity of the Type I 
ribosome-inactivating protein, 

25 The present invention provides an agent toxic to 

a cell including an analog of a Type I ribosome- 
inactivating protein linked by a disulfide bond through a 
cysteine to a molecule which specifically binds to the 
cell, wherein the analog has a cysteine available for 

30 intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and corresponding to a 
position on the surface of ricin A-chain in its natural 

35 conformation, and wherein the analog retains ribosome- 
inactivating activity of the Type I ribosome-inactivating 
protein • 
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A method according to the present invention for 
preparing an agent toxic to a cell may include the step of 
linking an analog of a Type I ribosome-inactivating protein 
through a cysteine to a molecule which specifically binds 
5 to the cell, which analog has a cysteine available for 

intennolecular disulfide bonding located at an amino acid - 
position corresponding to a position not naturally 
available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and corresponding to a 
10 position on the surface of ricin A-chain in its natural 
conformation, and which analog retains ribosome- 
inactivating activity of the Type I ribosome-inactivating 
protein. 

A method according to the present invention for 

15 treating a disease in which elimination of particular cells 

is a goal includes the step of administering to a patient ~ 
having the disease a therapeutically effective amount of an 
agent toxic to the cells wherein the agent includes a type 
I RIP fused to or an analog of a Type I ribosome- 

20 inactivating protein linked by a disulfide bond through a 

cysteine to a molecule which specifically binds to the 
cell, which analog has a cysteine available for 
intermolecular disulfide bonding located at an amino acid 
position corresponding to a position not naturally 

25 available for intermolecular disulfide bonding in the Type 
I ribosome-inactivating protein and corresponding to a 
position on the surface of ricin A-chain in its natural 
conformation, and which analog retains ribosome- 
inactivating activity of the Type I ribosome-inactivating 

30 protein. 

The RIP analogs of the invention are particularly 
suited for use as components of cytotoxic therapeutic 
agents* Cytotoxic agents according to the present 
invention may be used in vivo to selectively eliminate any 
35 cell type to which the RIP component is targeted by the 
specific binding capacity of the second component. To form 
cytotoxic agents, RIP analogs may be conjugated to 
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monoclonal antibodies, including chimeric and CDR-grafted 
antibodies, and antibody domains /fragments (e.g., Fab, 
Fab 1 , FCab'h, single chain antibodies, and Fv or single 
variable domains) . Analogs of RlPs conjugated to 
5 monoclonal antibodies genetically engineered to include 
free cysteine residues are also within the scope of the 
present invention. Examples of Fab' and F(ab , ) 2 fragments 
useful in the present invention are described in co- 
pending, co-owned U.S. Patent Application Serial No, 

10 07/714,175, filed June 14, 1991 and in International 

Publication No. WO 89/00999 published on February 9, 1989, 
which are incorporated by reference herein. 

The RIP analogs of the invention may preferably 
be conjugated or fused to humanized or human engineered 

15 antibodies, such as the he3 antibody described herein. 

Such humanized antibodies may be constructed from mouse - 
antibody variable domains, such as the mouse antibody H65 
(SEQ ID NOS: 123 and 124). Specifically RIP analogs 
according to the present invention may be conjugated or 

20 fused to he3 human-engineered antibody light and heavy 

chain variable regions (SEQ 10 NO: 125 and 126, 
respectively) or fragments thereof. A cell line producing 
an intact he3 immunoglobulin was deposited as ATCC 
Accession No. HB11206 with the American Type Culture 

25 Collection, 12301 Parklawn Drive, Rockville, Maryland 

20852. 

RIPs according to the present invention may also 
be conjugated to targeting agents other than antibodies, 
for example, lectins which bind to cells having particular 

30 surface carbohydrates, hormones, lymphokines , growth 
factors or other polypeptides which bind specifically to 
cells having particular receptors. Immunoconjugates 
including RIPs may be described as immunotoxins . An 
immunotoxin may also consist of a fusion protein rather 

35 than an immunoconjugate. 

The present invention provides gene fusions of an 
antigen-binding portion of an antibody (e.g., an antibody 
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light chain or truncated heavy chain, or a single chain 
antibody) or any targeting agent listed in the foregoing 
paragraph, linked to a Type I RIP. Preferably, the 
antigen-binding portion of an antibody or fragment thereof 
5 comprises a single chain antibody, a Fab fragment, or a 

F(ab') 2 fragment. Active antibodies generally have a 
conserved three-dimensional folding pattern and it is 
expected that any antibody which maintains that folding 
pattern will retain binding specificity. Such antibodies 
10 are expected to retain target enzymatic activity when 

incorporated into a fusion protein according to the present 
invention. 

It is sometimes necessary that immunotoxins 
comprising cytotoxic components, such as RIPs, be attached 
15 to targeting agents via cleavable linkers (i.e., 

disulfides, acid-sensitive linkers, and the like) in order 
to allow intracellular release of the cytotoxic component. 
Such intracellular release allows the cytotoxic component 
to function unhindered by possible negative kinetic or 

2 0 steric effects of the attached antibody. Accordingly, gene 

fusions of the present invention may comprise a RIP gene 
fused, via a DNA segment encoding a linker protein as 
described above, to either the 5' or the 3' end of a gene 
encoding an antibody. If a linker is used, it preferably 

25 encodes a polypeptide which contains two cysteine residues 

participating in a disulfide bond and forming a loop which 
includes a protease-sensitive amino acid sequence (e.g., a 
segment of E. coli shiga-like toxin as in SEQ ID NO: 56) or 
a segment which contains several cathepsin cleavage sites 

30 (e.g., a segment of rabbit muscle aldolase as in SEQ ID NO: 

57; a segment of human muscle aldolase; or a synthetic 
peptide including a cathepsin cleavage sites such as in SEQ 
ID NOs: 141 or 142) . A linker comprising cathepsin 
cleavage sites as exemplified herein comprises the C- 

3 5 terminal 20 amino acids of RMA. However, that sequence 

differs by only one amino acid from human muscle aldolase 
and it is contemplated that muscle aldolase from human or 
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other sources may be used as a linker in the manner 
described below. The Type I RIP portion of the fused genes 
preferably encodes gelonin, BRIP or momordin II. Also 
preferably, the antibody portion of the fused genes 
5 comprises sequences encoding one of the chains of an 

antibody Fab fragment (i.e., kappa or Fd) and the fused 
gene is co-expressed in a host cell with the other Fab 
gene, or the antibody portion comprises sequences encoding 
a single chain antibody. 

10 Alternatively, since fusion proteins of the 

present invention may be of low (approximately 55 kDa) 
molecular weight while maintaining full enzymatic activity, 
such fusions may be constructed without a linker and still 
possess cytotoxic activity. Such low-molecular weight 

15 fusions are not as susceptible to kinetic and steric 

hinderance as are the larger fusions, such as fusions - 
involving IgG molecules. Therefore, cleavage of the RIP 
away from the fusion may not be necessary to facilitate 
activity of the RIP. 

20 The present invention also provides a method for 

purifying a protein or immunotoxin comprising a ribosome- 
inactivating protein and a portion of an antibody including 
the steps of passing a solution containing the protein 
through an anion exchange column; applying the flow-through 

25 to a protein G column; and eluting the protein from the 
protein G column. The method may further comprise the 
steps of introducing the flow-through of the anion exchange 
column into a cation exchange column; exposing the cation 
exchange column to an eluent effective to elute said 

30 protein; and then applying the eluted protein to a protein 

G column, rather than applying the anion exchange column 
flow- through directly to a protein G column. 

Immunotoxins according to the present invention, 
including immunoconjugates and fusion proteins 

35 (immunofusions) , are suited for treatment of diseases where 

the elimination of a particular cell type is a goal, such 
as autoimmune disease, cancer, and graf t-versus-host 
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disease. The immunotoxins are also suited for use in 
causing immunosuppression and in treatment of infections by 
viruses such as the Human Immunodeficiency Virus* 

Specifically illustrating polynucleotide 
5 sequences according to the present invention are the 

inserts in the plasmid pING3731 in E. coli MC1061 
(designated strain G274) and in the plasmid pING3803 in E. 
coli £104 (designated strain G275) , both deposited with the 
American Type Culture Collection (ATCC) , 12301 Parklawn 

10 Drive , Rockville, Maryland, on October 2, 1991 , and 
assigned ATCC Accession Nos. 68721 and 68722, respectively. 
Additional polynucleotide sequences illustrating the 
invention are the inserts in the plasmid pING3746 in E. 
coli £104 (designated strain G277) and in the plasmid 

15 pING3737 in E. coli E104 (designated strain G276) , which 

were both deposited with the ATCC on June 9, 1992, and were - 
respectively assigned Accession Nos. 69008 and 69009. 
Still other polynucleotide sequences illustrating the 
invention are the inserts in the plasmid pING3747 in E. 

20 coli £104 (designated strain G273), in the plasmid pING3754 

in E. coli £104 (designated strain G279) , in the plasmid 
pING3758 in E. coli £104 (designated strain G280) and in 
the plasmid pING3759 in E. coli E104 (designated strain 
G281) , which plasmids were all deposited with the ATCC on 

25 October 27, 1992 and were assigned ATCC Accession Nos. 

69101, 69102, 69103 and 69104, respectively. 

As noted above, RIPs may preferably be conjugated 
or fused to humanized or human-engineered antibodies, such 
as he3. Thus f the present invention also provides novel 

30 proteins comprising a humanized antibody variable domain 
which is specifically reactive with an human CD5 cell 
differentiation marker. Specifically, the present 

invention provides proteins comprising the he3 light and 
heavy chain variable regions as shown in SEQ ID NOS: 125 or 

35 126, respectively. DNA encoding certain he3 proteins is 
shown in SEQ ID NOS: 72 and 71. 
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In a preferred embodiment of the present- 
invention, the protein comprising an humanized antibody 
variable region is an intact he3 immunoglobulin deposited 
as ATCC HB 11206. 
5 Also in a preferred embodiment of the invention, 

the protein comprising a humanized antibody variable 
region is a Fab or F(ab'>2 or Fab fragment. 

Proteins according to the present invention may 
be made by methods taught herein and in co-pending, co- 

10 owned U.S. patent application no. 07/808,464 by studnicka 

et al. incorporated by reference herein; and modified 
antibody variable domains made by such methods may be used 
in therapeutic administration to humans either alone or as 
part of an immunocon jugate as taught in co-owned, co- 

15 pending U.S. Patent Application No. 07/787,567 by Better et 

al. 

The present invention also provides methods for 
preparing a modified antibody variable domain useful in 
preparing immunotoxins and immunofusions by determining the 
20 amino acids of a subject antibody variable domain which may 
be modified without diminishing the native affinity of the 
domain for antigen while reducing its immunogenic ity with 
respect to a heterologous species. As used herein, the 
term "subject antibody variable domain" refers to the 
25 antibody upon which determinations are made. The method 

includes the following steps: determining the amino acid 
sequence of a subject light chain and a subject heavy chain 
of a subject antibody variable domain to be modified; 
aligning by homology the subject light and heavy chains 
30 with a plurality of human light and heavy chain amino acid 
sequences; identifying .the amino acids in the subject light 
and heavy chain sequences which are least likely to 
diminish the native affinity of the subject variable domain 
for antigen while, at the same time, reducing its 
35 immunogenicity by selecting each amino acid which is not in 

an interface region of the subject antibody variable domain 
and which is not in a complementarity-determining region or 
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in an antigen-binding region of the subject antibody 

variable domain, but which amino acid is in a position 

exposed to a solvent containing the antibody; changing each 

residue identified above which aligns with a highly or a 
5 moderately conserved residue in the plurality of human 

light and heavy chain amino acid sequences if said ♦ 

identified amino acid is different from the amino acid in 

the plurality. 

Another group of sequences, such as those in 
10 Figures 1A and IB may be used to determine an alignment 

from which the skilled artisan may determine appropriate 

changes to make* 

The present invention provides a further method 

wherein the plurality of human light and heavy chain amino 
15 acid sequences is selected from the human consensus 

sequences in Figures 10A and 10B. 

In general, human engineering according to the 

above methods may be used to treat various diseases against 

which monoclonal antibodies generally may be effective. 
20 However, humanized antibodies possess the additional 

advantage of reducing the immunogenic response in the 

treated patient. 

Additional aspects and applications of the 

present invention will become apparent to the skilled 
25 artisan upon consideration of the detailed description of 

the invention which follows. 



BRICT DgflCRIPTIOH 07 THB DRAWINGS 

FIG. 1 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (RTA) (SEQ ID NO: 
3 0 1) with the amino acid sequence of the Type I ribosome- 

inactivating protein gelonin (SEQ ID NO: 2), wherein 
starred positions indicate amino acids invariant among the 
ricin A-chain and the Type I RIPs; 

FIG. 2 is a computer-generated alignment of the 
35 amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
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inactivating protein BRIP (SEQ ID NO: 3), wherein starred 
positions indicate amino acids invariant among the ricin A- 
chain and the Type I RIPs; 

FIG. 3 is a computer-generated alignment of the 
5 amino acid sequence of the ricin A-chain (SEQ ID NO: i) 

with the amino acid sequence of the Type I ribosome- 
inactivating protein momordin II (MOMOII) (SEQ ID NO: 4) , 
wherein starred positions indicate amino acids invariant 
among the ricin A-chain and the Type I RIPs; 

10 FIG. 4 is a computer-generated alignment of the 

amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
inactivating protein luff in (SEQ ID NO: 5) , wherein starred 
positions indicate amino acids invariant among the ricin A- 

15 chain and the Type I RIPs; 

FIG. 5 is a computer-generated alignment of the . 
amino acid sequence of the ricin A-chain (SEQ ID NO: l) 
with the amino acid sequence of the Type I ribosome- 
inactivating protein atrichosanthin (TRICHO) (SEQ ID NO: 

20 6) , wherein starred positions indicate amino acids 

invariant among the ricin A-chain and the Type I RIPs; 

FIG. 6 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 

25 inactivating protein momordin I (MOMOI) (SEQ ID NO: 7) , 

wherein starred positions indicate amino acids invariant 
among the ricin A-chain and the Type I RIPs; 

FIG. 7 is a computer-generated alignment of the 
amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 

30 with the amino acid sequence of the Type I ribosome- 
inactivating protein Mirabills anti-viral protein (MAP) 
(SEQ ID NO: 8), wherein starred positions indicate amino 
acids invariant among the ricin A-chain and the Type I 
RIPs; 

35 FIG. 8 is a computer-generated alignment of the 

amino acid sequence of the ricin A-chain (SEQ ID NO: 1) 
with the amino acid sequence of the Type I ribosome- 
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inactivating protein pokeweed antiviral protein from seeds (PAPS) 
(SEQ ID NO: 9), wherein starred positions indicate amino acids 
5 invariant among the ricin A-chain and the Type I RIPs; 

FIG, 9 is a computer-generated alignment of the amino 
acid sequence of the ricin A-chain (SEQ ID NO: 1) with the amino 
acid sequence of the Type I ribo some inactivating protein saporin 
6 (SAP6) (SEQ ID NO: 10) , wherein starred positions indicate 
10 amino acids invariant among the ricin A-chain and the Type I 
RIPs; 

FIGs. 10A and 10B are alignments of the consensus 
amino acid sequences for the subgroups of light [hKl (SEQ ID NO: 
149) (human kappa light chain subgroup 1) , hK3 (SEQ ID NO: 150) 

15 (human kappa light chain subgroup 3), hK2 (SEQ ID NO: 151) (human 
kappa light chain subgroup 2), hLl (SEQ ID NO: 152) (human lambda 
light chain subgroup 1) , hL2 (SEQ ID NO: 153) (human lambda 
light chain subgroup 2), hL3 (SEQ ID NO: 154) (human lambda light 
chain subgroup 3) , hL6 (SEQ ID NO: 155) (human lambda light chain 

20 subgroup 6) , hK4 (SEQ ID NO: 156) (human kappa light chain 
subgroup 4), hL4 (SEQ ID NO: 157) (human lambda light chain 
subgroup 4) and hL5 (SEQ ID NO: 158) (human lambda light chain 
subgroup 5)] and heavy chains [hH3 (SEQ ID NO: 159) (human heavy 
chain subgroup 3), hHl (SEQ ID NO: 160) (human heavy chain 

25 subgroup 1) and hH2 (SEQ ID NO: 161) (human heavy chain subgroup 
2)], respectively, of human antibody variable domains; 

FIGs. 11A and 11B set out the nucleotide sequences of 
the oligonucleotides utilized in the construction of the genes 
encoding modified V/J-regions of the light and heavy chains of 

30 the H65 mouse monoclonal antibody variable domain sequence $H65K- 
1:SEQ ID No. 117; HUH-K1:SEQ ID No. 141; HUH-K2:SEQ ID No. 142; 
HUH-K3:SEQ ID No. 143; HUH-K4:SEQ ID No. 121; HUH-K5:SEQ ID No. 
122; HUH-G1:SEQ ID No. 144; HUH-G2:SEQ ID No. 145; HUH-G3:SEQ ID 
No. 137; HUH-G4:SEQ ID No. 138; HUH-G5:SEQ ID No. 139; HDH-G6 : SEQ 

35 ID No. 140; H65G-2S:SEQ ID No. 146; H65-G2:SEQ ID No. 85; H65K- 
2S:SEQ ID No. 116; JKl-Hindlll : SEQ ID No. 87; and 

FIGs. 12A and 12B are alignments of human light chain 
consensus hKl (SEQ ID No. 149) and heavy chain consensus hHl (SEQ 
ID No. 160) with the light and heavy chain sequences, 

40 respectively, of the variable domain of human antibody EU (SEQ ID 
Nos. 162 and 166), human antibody TAG (SEQ ID Nos. 163 and 167), 
human antibody TAC modified according to the present invention 
(prop) (SEQ ID Nos. 164 and 168) and human antibody TAC modified 
according to a different method (Que) (SEQ ID Nos. 165 and 169) . 
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DETATIiED DBSCRIPTip M 
Nucleotide sequences of genes encoding three 
plant Type I RIPs and expression vectors containing the 
genes are provided by the present invention. A first plant 
5 RIP, gelonin, is produced by seeds of Gelonium multiflozvm, 

a plant of the Euphorbiaceae family native to the tropical 
forests of eastern Asia, while a second plant RIP, BRIP, is 
synthesized by the common cereal grain barley. Mom or din 
II, a third plant RIP, is produced in Momordica balsamlna 

10 seeds* Analogs of BRIP are also provided by the present 
invention. The analogs were genetically engineered to 
include a cysteine free to participate in a intermolecular 
disulfide bond and were conjugated to antibody molecules 
without non-specific chemical derivatization of the RIP 

15 with crosslinking agents. 

Type I RIP analogs of the present invention offer - 
distinct advantages over the natural proteins for use as 
components of immunotoxins . Chemical treatment to 
introduce free sulfhydryl groups in the natural proteins 

20 lacking free cysteines typically involves the non-selective 

modification of amino acid side chains. This non- 
selectivity often results in antibodies conjugated to 
different sites on different RIP molecules (i.e., a 
heterogeneous population of conjugates) and also in a 

25 decrease in RIP activity if antibodies are conjugated in or 

near important regions of the RIP (e.g., the active site or 
regions involved in translocation across cell membranes) . 
In contrast, RIP analogs according to the present invention 
may be conjugated to a single antibody through a disulfide 

30 bond to a specific residue of the analog resulting in 
reduced batch to batch variation of the immunoconjugates 
and, in some cases, immunoconjugates with enhanced 
properties (e.g., greater cytotoxicity or solubility). 

Type I plant RIPs, as well as bacterial RIPs such 

35 as Shiga and shiga-like toxin A-chains, are homologous to 

the ricin A-chain and are useful in the present invention. 
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Type I RlPs may be defined and sites for 
substitution of a cysteine in a RIP may be identified by 
comparing the primary amino acid sequence of the RIP to the 
natural ricin A-chain amino acid sequence, the tertiary 
5 structure of which has been described in Katzin et al., 
Proteins, 20:251-259 (1991) , which is incorporated by 
reference herein. 

Amino acid sequence alignment defines Type I RIPs 
in that the ricin A-chain and the Type I plant RIPs have 

10 nine invariant amino acids in common. Based on the ricin 

sequence the invariant amino acids are tyrosine 21/ 
arginine 29 , tyros ine 80 , tyrosine 123 , leucine 144/ glutamic 
acid 177 , alanine 17B , arginine 180 , and tryptophan 211 . The ricin 
A-chain may be used as a model for the three-dimensional 

15 structure of Type I RIPs. A protein lacking a cysteine 
available for conjugation while having ribosome- 
inactivating activity and having the nine invariant amino 
acids when its primary sequence is compared to the primary 
sequence of the ricin A-chain [according to the alignment 

20 algorithm of Myers et al., CABIOS COMMUNICATIONS, 4(1) : 11- 

17 (1988) , implemented by the PC /GENE program PALIGN 
(Intelligenetics, Inc. , Mountain View, California) and 
utilizing the Oayhoff Mutation Data Matrix (MDM-78) as 
described in Schwartz et al., pp. 353-358 in Atlas of 

25 Protain Sequence and Structure, 5 Supp. 3, National 
Biomedical Research Foundation, Washington, D.C. (1978)] is 
defined as a Type I RIP herein and is expected to be useful 
in the present invention. "Corresponding" refers herein to 
amino acid positions which align when two amino acid 

30 sequences are compared by the strategy of Myers et al., 
supra . 

The primary amino acid sequences of the Type I 
RIPs:gelonin, BRIP, momordin II, luffin [see Islam et al., 
Agricultural Biological Chem., 54(5) : 1343-1345 (199) ] , 
35 atrichosanthin [see Chow et al., J. Biol. Chem., 265:8670- 
8674 (1990)], momordin I [see Ho et al., BBA, 1088:311-314 
(1991)], Mirabilis anti-viral protein [see HabuJca et al., 
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J. Biol* Chem., 264 (12) : 6629-6637 (1989)], pokeweed 
antiviral protein isolated from seeds [see Kung et al., 
Agrric. Biol. Chem., 54 (12) : 3301-3318 (1990)] and saporin 
[see Benatti et al., Eur. *7. Biochem. , 183:465-470 (1989)] 
5 are individually aligned with the primary sequence of the 

ricin A- chain [see Hailing et al., Nucleic Acids Res., 
13:8019-8033 (1985)] in FIGs 1-9, respectively, according 
to the algorithm of Myers et al., supra, as specified 
above. 

10 FIGS. 1-9 may be utilized to predict the amino 

acid positions of the Type I RIPs where cysteine residues 
may be substituted. Preferred amino acids for cysteine 
substitution are on the surface of the molecule and include 
any solvent accessible amino acids which will not interfere 

15 with proper folding of the protein if replaced with a 
cysteine. A region of the ricin A-chain comprising such, 
amino acids is the carboxyl terminal region. Amino acids 
that should be avoided for replacement are those critical 
for proper protein folding, such as proline, and those that 

20 are solvent inaccessible. Also to be avoided are the nine 
amino acids invariant among RIPs, and the amino acids in or 
near regions comprising the active site of ricin A-chain as 
depicted in Figure 6 of Katzin et al., supra. 

Therefore, a preferred region of substitution for 

25 Type I RIPs is their carboxyl terminal region which is 
solvent accessible and corresponds to the carboxyl terminal 
region where Type II RIP A-chains and B-chains are 
naturally linked by a disulfide bond. As shown in the 
examples, a cysteine may be substituted in positions in the 

30 amino acid sequence of a Type I RIP from the position 
corresponding to position 251 in SEQ ID NO: 1 to the 
carboxyl terminal position of said Type I RIP, resulting in 
RIP analogs which retain enzymatic activity and gain 
disulfide cross-linking capability. One preferred cysteine 

35 substitution position is near the position which 
corresponds to the cysteine at position 259 in the ricin A- 
chain. 
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For purposes of the present invention, 
immunotoxins comprise a class of compounds of which toxin- 
antibody fusions and immunoconjugates are examples. 
Immunotoxins are particularly suited for use in treatment 
5 of human autoimmune diseases and in the treatment of 
diseases in which depletion of a particular cell type is a 
goal, such as cancer. For example, treatment of autoimmune 
diseases with immunotoxins is described in International 
Publication No. WO89/06963 published August 10, 1989, which 

10 is incorporated by reference herein. 

In any treatment regimen, the immunotoxins may be 
administered to a patient either singly or in a cocktail 
containing two or more immunotoxins, other therapeutic 
agents, compositions, or the like, including, but not 

15 limited to, immunosuppressive agents, tolerance- inducing 

agents, potentiators and side-effect relieving agents. - 
Particularly preferred are immunosuppressive agents useful 
in suppressing allergic reactions of a host. Preferred 
immunosuppressive agents include prednisone, prednisolone, 

20 DECADRON (Merck, Sharp & Dohme, West Point, Pennsylvania) , 

cyclophosphamide, cyclosporine, 6-mercaptopurine, 
methotrexate, azathioprine and i.v. gamma globulin or their 
combination. Preferred potentiators include monensin, 
ammonium chloride, per hexi line, verapamil, amantadine and 

25 chloroguine. All of these agents are administered in 

generally-accepted efficacious dose ranges such as those 
disclosed in the Physician's Desk Reference, 41st Ed., 
Publisher Edward R. Barnhart, New Jersey (1987). Patent 
Cooperation Treaty (PCT) patent application wo 89/069767 

30 published on August 10, 1989, discloses administration of 
an immunotoxin as ^an immunosuppressive agent and is 
incorporated by reference herein. 

Immunotoxins of the present invention may be 
formulated into either an injectable or topical 

35 preparation. Parenteral formulations are known and are 
suitable for use in the invention, preferably for 
intramuscular or intravenous administration. The 
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formulations containing therapeutically-ef f ective amounts 
of immunotoxins are either sterile liquid solutions, liquid 
suspensions, or lyophilized versions, and optionally 
contain stabilizers or excipients. Lyophilized 
5 compositions are reconstituted with suitable diluents, 

e.g., water for injection, saline, 0.3% glycine and the 
like, at a level of about from 0.01 mg/kg of host body 
weight to 10 mg/kg where the biological activity is less 
than or equal to 20 ng/ml when measured in a reticulocyte 

10 lysate assay* Typically, the pharmaceutical compositions 

containing immunotoxins of the present invention are 
administered in a therapeutically effective dose in a range 
of from about 0.01 mg/kg to about 5 mg/kg of the patient. 
A preferred, therapeutically effective dose of the 

15 pharmaceutical composition containing immunotoxins of the 

invention is in a range of from about 0.01 mg/kg to about- 
0.5 mg/kg body weight of the patient administered over 
several days to two weeks by daily intravenous infusion, 
each given over a one hour period, in a sequential patient 

20 dose-escalation regimen. 

Immunotoxin compositions according to the 
invention may be formulated into topical preparations for 
local therapy by including a therapeutically effective 
concentration of immunotoxin in a dermatological vehicle. 

25 The amount of immunotoxin to be administered, and the 
immunotoxin concentration in the topical formulations, 
depend upon the vehicle selected, the clinical condition of 
the patient, the systemic toxicity and the stability of the 
immunotoxin in the formulation. Thus, a physician knows to 

30 employ the appropriate preparation containing the 

appropriate concentration of immunotoxin in the 
formulation, as well as the appropriate amount of 
formulation to administer depending upon clinical 
experience with the patient in question or with similar 

35 patients. The concentration of immunotoxin for topical 
formulations is in the range of greater than from about 0.1 
mg/ml to about 25 mg/ml. Typically, the concentration of 
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immunotoxin for topical formulations is in the range of 
greater than from about 1 mg/ml to about 20 mg/ml. Solid 
dispersions of immunotoxins according to the invention, as 
well as solubilized preparations , may be used. Thus, the 
5 precise concentration to be used in the vehicle is subject 

to modest experimental manipulation in order to optimize 
the therapeutic response. For example, greater than about 
10 mg immunotoxin/100 grams of vehicle may be useful with 
1% w/w hydrogel vehicles in the treatment of skin 

10 inflammation. Suitable vehicles, in addition to gels, are 

oil-in-water or water- in-oil emulsions using mineral oils, 
petroleum and the like* 

Immunotoxins according to the invention may be 
optionally administered topically by the use of a 

15 transdermal therapeutic system [Barry , D&rmatological 

Formulations, p. 181 (1983) and literature cited therein]. * 
While such topical delivery systems may be designed for 
transdermal administration of low molecular weight drugs, 
they are capable of percutaneous delivery. Further, such 

20 systems may be readily adapted to administration of 

immunotoxin or derivatives thereof and associated 
therapeutic proteins by appropriate selection of the rate- 
controlling microporous membrane. 

Topical preparations of immunotoxin either for 

25 systemic or local delivery may be employed and may contain 

excipients as described above for parenteral administration 
and other excipients used in a topical preparation such as 
cosol vents, surfactants, oils, humectants, emollients, 
preservatives, stabilizers and antioxidants. 

30 Pharmacologically-acceptable buffers may be used, e.g., 

Tris or phosphate buffers. The topical formulations may 
also optionally include one or more agents variously termed 
enhancers, surfactants, accelerants, adsorption promoters 
or penetration enhancers, such as an agent for enhancing 

35 percutaneous penetration of the immunotoxin or other 

agents. Such agents should desirably possess some or all 
of the following features as would be known to the 
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ordinarily skilled artisan: pharmacological inertness, 
non-promotive of body fluid or electrolyte loss, compatible 
with immunotoxin (non- inactivating) , and capable of 
formulation into creams, gels or other topical delivery 
5 systems as desired. 

Immunotoxins according to the present invention 
may also be administered by aerosol to achieve localized 
delivery to the lungs. This is accomplished by preparing 
an aqueous aerosol , liposomal preparation or solid 

10 particles containing immunotoxin. Ordinarily, an aqueous 

aerosol is made by formulating an aqueous solution or 
suspension of immunotoxin together with conventional 
pharmaceutical^ acceptable carriers and stabilizers. The 
carriers and stabilizers vary depending upon the 

15 requirements for the particular immunotoxin, but typically 

include: nonionic surfactants (Tweens, Pluronics, or 
polyethylene glycol) ; innocuous proteins like serum 
albumin, sorbitan esters, oleic acid, lecithin; amino acids 
such as glycine; and buffers, salts, sugars or sugar 

20 alcohols. The formulations may also include mucolytic 

agents as well as bronchodilating agents. The formulations 
are sterile. Aerosols generally are prepared from isotonic 
solutions. The particles optionally include normal lung 
surfactants • 

25 Alternatively, immunotoxins of the invention may 

be administered orally by delivery systems such as 
proteinoid encapsulation as described by Steiner, et al., 
U.S. Patent No. 4,925,673, incorporated by reference 
herein. Typically, a tfaerapeutically-ef f ective oral dose 

30 of an immunotoxin according to the invention is in the 
range from about 0.05 mg/kg body weight to about 50 mg/kg 
body weight per day. A preferred effective dose is in the 
range from about 0.05 mg/kg body weight to about 5 mg/kg 
body weight per day. 

35 Immunotoxins according to the present invention 

may be administered systemically, rather than topically, by 
injection intramuscularly, subcutaneous ly, intrathecally or 
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intraperitoneally or into vascular spaces, particularly 
into the joints, e.g., intraarticular injection at a dosage 
of greater than about 1 Mg/cc joint fluid/day. The dose 
will be dependent upon the properties of the specific 
5 immunotoxin employed, e.gr., its activity and biological 

half -life, the concentration of immunotoxin in the 
formulation, the site and rate of dosage, the clinical 
tolerance of the patient involved, the disease afflicting 
the patient and the like, as is well within the skill of 

10 the physician. 

The immunotoxins of the present invention may be 
administered in solution. The pH of the solution should be 
in the range of pH 5 to 9.5, preferably pH 6.5 to 7.5. The 
immunotoxin or derivatives thereof should be in a solution 

15 having a suitable pharmaceutically-acceptable buffer such 

as phosphate, Tris(hydroxymethyl)aminomethane-HCl or 
citrate and the like. Buffer concentrations should be in 
the range of 1 to 100 mM. The immunotoxin solution may 
also contain a salt, such as sodium chloride or potassium 

20 chloride in a concentration of 50 to 150 mM. An effective 

amount of a stabilizing agent such as an albumin, a 
globulin, a gelatin, a protamine or a salt of protamine may 
also be included, and may be added to a solution containing 
immunotoxin or to the composition from which the solution 

25 is prepared. 

Systemic administration of immunotoxin may be 
made daily and is generally by intramuscular injection, 
although intravascular infusion is acceptable. 
Administration may also be intranasal or by other 

30 nonparenteral routes. Immunotoxins of the present 
invention may also be administered via microspheres, 
liposomes or other micr opart iculate delivery systems placed 
in certain tissues including blood. Topical preparations 
are applied daily directly to the skin or mucosa and are 

35 then preferably occluded, i.e., protected by overlaying a 
bandage, polyolefin film or other barrier impermeable to 
the topical preparation. 
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The following Examples are illustrative of 
practice of the invention but are not to be construed as 
limiting the invention. The present application is broadly 
organized as follows. The first portion of the application 
5 broadly teaches the preparation, expression and properties 

of an exemplary RIP, gelonin. A second portion of the 
application teaches the preparation of human-engineered 
antibodies. A third portion of the application teaches the 
construction and properties of immunoconjugates, comprising 

10 an RIP and an antibody or fragment thereof comprising an 

antigen-binding portion, A forth portion of the 

application relates to the preparation and properties of 
immunofusion proteins comprising an RIP and an antibody or 
fragment thereof comprising an antigen-binding portion. A 

15 fifth portion of the application teaches the preparation 

and properties of the RIP Barley ribosome-inactivating 
protein and a final aspect of the invention provides the 
preparation and properties of the RIP momordin. 

Specifically, Example 1 relates to the 

20 preparation of the RIP gelonin. Construction of expression 
vector, comprising the gelonin gene, including expression 
and purification of gelonin, is taught in Example 2. The 
assembly of gelonin genes with cysteine residues available 
for conjugation is taught in Example 3 and Example 4 

25 provides results of a reticulocyte lysate assay performed 
on gelonin. 

Example 5 teaches the construction of human- 
engineered antibodies for use in immunotoxins of the 
invention and Example 6 demonstrates transfection of he3 

30 genes, expression of those genes, and purification of the 
products thereof. 

Example 7 next teaches the preparation of gelonin 
immunoconjugates. The procedures and results of whole cell 
kill assays are next presented in Example 8. Various 

3 5 properties of gelonin immunoconjugates are taught in 

Example 9 and Examples 10 and 11 teach the pharmacokinetics 
of two types of immunoconjugates. Examples 12 and 13 teach 
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the immunogenicity of immunoconjugates of the invention and 
the in vivo efficacy of those immunoconjugates , 
respectively. 

The construction of genes encoding gelonin 
5 immunofusions is taught in Examples 14, 15, 16, 17 and 13. 

Example 19 teaches alternative cathepsin cleavable linkers 
for use itt the immunofusions of the invention. The 
expression and purification of various genes encoding 
immunoconjugates are presented in Example 20 and their 
10 activity properties are presented in. Example 21. 

The construction of genes encoding the RIP , BRIP, 
and its expression and properties are- taught in Examples 
22 , 23 , and 24. 

Finally, construction of genes encoding momordin 
15 and properties of-- momordin on expression are taught in 

Example 25* 

Example 3, 

Preparation Of Gelonin 

The cloning of the gelonin gene according to the 

20 present invention obviates the requirement of purifying the 

RIP gene product from its relatively scarce natural source, 
G. multiflonim seeds. Cloning also allows development of 
gelonin analogs which may be conjugated to antibodies 
without prior chemical derivatization and also allows 

25 development of gelonin gene fusion products. 

a. preparation ot *WA fro* g. Muittfiginm gegdg 

Total RNA was prepared from Gel on i urn seeds (Dr. 
Michael Rosenblum, M.D. Anderson Cancer Center, Houston, 
Texas) by a modification of the procedure for preparation 

30 of plant RNA described in Ausubel et al., eds., Current 

Protocols in Molecular Biology, Wiley & Sons, 1989. 
Briefly, 4.0 grams of seeds were ground to a fine powder in 
a pre-cooled (-70 *C) mortar and pestle with liquid K 2 . The 
powder was added to 25 ml Grinding buffer (0.18M Tris, 

35 0.09M LiCl, 4.5mM EDTA, 1% SDS, pH 8.2) along with 8.5 ml 
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of phenol equilibrated with TLE (0.2K Tris, 0.1M LiCl, 5mM 
EDTA pH8.2). The mixture was homogenized using a Polytron 
PT-1035 (#5 setting). 8.5 ml of chloroform was added, 
mixed and incubated at 50 *C for 20 minutes. The mixture 
5 was centrifuged at 3000 g for 20 minutes in a rotor 

precooled to 4*C and the aqueous phase was transferred to 
a new tube. 8.5 ml of phenol was added followed by 8.5 ml 
of chloroform and the mixture was recentrifuged. This 
extraction was repeated 3 times. The RNA in the aqueous 

10 phase was then precipitated by adding 1/3 volume 8M LiCl, 

and incubated at 4*C for 16 hours. Next, the RNA was 
pelleted by centrifugation for 20 minutes at 4*C. The 
pellet was washed with 5 ml of 2M LiCl, recentrifuged and 
resuspended in 2 ml of water. The RNA was precipitated by 

15 addition of NaOAc to 0.3M and 2 volumes of ethanol. The 

RNA was stored in 70% ethanol at -70 'C. 

B. cDNA Preparation 

cONA was prepared from total Gelonium RNA by two 
methods. The first method involved making a cONA library 

20 in the bacterial expression plasmid pcDNAII using the 

Librarian IX cDNA Library Construction System kit 
(Invitrogan) . Approximately 5 ug of total RNA was 
converted to first strand cDNA with a 1:1 mixture of random 
primers and oligo-dT. Second strand synthesis with DNA 

25 polymerase X was performed as described by the system 

manufacturer. Double stranded cONA was ligated to BstXl 
linkers and size fractionated ♦ Pieces larger than about 
500 bp were ligated into the expression vector provided in 
the kit. Individual vectors were introduced into E. coli 

30 either by transformation into high-efficiency competent 

cells or by electroporation into electrocompetent cells. 
Electroporation was performed with a BTX100 unit (BTX, San 
Diego, CA) in 0.56m Flatpack cells as recommended by BTX 
based on the method of Dower et a!.. Nucleic Acids Res., 

35 15:6127-6145 (1988), at a voltage amplitude of 850 V and a 
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pulse length of 5 mS. The resulting library consisted of 
approximately 150,000 colonies. 

The second method involved generating cDNA using 
the RNA-PCR kit sold by Perkin-Elmer-Cetus. About 100 ng 
of total Gelonium RNA was used as template for cDNA 
synthesis. 



c - Determination Of The Gelonin Protein Sequence 

The partial sequence of the native gelonin 
protein was determined by direct amino acid sequence 
analysis using automated Edman degradation as recommended 
by the manufacturer using an Applied Biosystems model 4 7 OA 
protein sequencer. Proteolytic peptide fragments of 
gelonin (isolated from the same batch of seeds as the total 
RNA) were sequenced. 



O. Cloning Of The Gelonin Gene 

Three overlapping gelonin cDNA fragments were 
cloned and a composite gelonin gene was assembled from the 
three fragments. 



1. Cloning Of The Fragment Encoding The Middle 
20 Amino Acids Of Ge lonin In Vector nING3823 

Degenerate DNA primers based on the gelonin 

partial amino acid sequences were used to PCR- amplify 

segments of the cDNA generated with Perkin-Elmer-Cetus kit. 

Six primers were designed based on regions of the gelonin 

25 amino acid sequence where degeneracy of the primers could 

be minimized. Appropriate pairs of primers were tested for 
amplification of a gelonin gene fragment. Products of the 
expected DNA size were identified as ethidium bromide- 
stained DNA bands on agarose gels that DNA was treated with 

30 T4 DNA polymerase and then purified from an agarose gel. 

Only the primer pair consisting of primers designated gelo- 
7 and gelo-5 yielded a relatively pure product of the 
expected size. The sequences of degenerate primers gelo-7 
and gelo-5 are set out below using IUPAC nucleotide 

35 symbols. 
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Gelo-7 (SEQ ID NO: 14) 

5 f TTYAARGAYGCN CCNGA YGCNGCNT AYG ARGG 3' 
Gelo-5 (SEQ ID NO: 15) 

3 1 TTYTT YATRATRCANTGNCGNCANCTRGT Y CA 5 f 
5 Primer gelo-7 corresponds to amino acids 87-97 of gelonin 

while primer gelo-5 corresponds to amino acids 226-236. 
The blunt-ended DNA fragment (corresponding to amino acids 
87 to 236 of gelonin) generated with primers gelo-7 and 
gelo-5 was cloned into pUC18 (BRI*, Gaithersburg, Maryland) ♦ 

10 The DNA sequence of the insert was determined, and the 

deduced amino acid sequence based on the resulting DNA 
sequence matched the experimentally determined gelonin 
amino acid sequence. The clone containing this gelonin 
segment was denoted pING3726. 

15 The insert of clone pING3726 was labeled with 32 P 

and used as a probe to screen the 150, 000-member Gelonium 
cDNA library. Only one clone hybridized to the library 
plated in duplicate. This clone was purified from the 
library and its DNA sequence was determined. The clone 

20 contains a fragment encoding 185 of the 270 amino acids of 
gelonin (residues 25-209) and is denoted pING3823. 



2. Cloning Of The Fragment Encoding 

The N-Terminal Amino Acids Of Gelonin 

Based on the sequence determined for the gelonin 

25 gene segment in pING3726, exact oligonucleotide primers 

were designed as PCR amplification primers to be used in 

conjunction with a degenerate primer to amplify a 5 9 

gelonin gene fragment and with a nonspecific primer to 

amplify a 3* gelonin gene fragment. cDNA generated using 

30 the Perkin-Elmer-Cetus RNA-PCR kit was amplified. 

To amplify the 5 '-end of the gelonin gene, PCR 

amplification with a degenerate primer gelo-l and an exact 

primer gelo-10 was performed. The sequences of the primers 

are set out below. 
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Gelo-1 (SEQ ID NO: 16) 

5' GGNYTNGAYACNGTNWSNTTYWSNACNAARGG 3< 
Gelo-10 (SEQ ID NO: 17) 
3 t TGTCTGAACCCGTAACTTGGTAA 5 1 
5 Primer gelo-1 corresponds to amino acids l-ll of the 

gelonin gene while primer gelo-10 corresponds to amino 
acids 126-133. The product from the reaction was re- 
amplified with gelo-1 (SEQ ID NO: 16) and gelo-11 (an exact 
primer comprising sequences encoding amino acids 119-125 of 
10 gelonin) to confer specificity to the reaction product. 
The sequence of primer gelo-11 is listed below. 

Gelo-11 (SEQ ID NO: 18) 
3* CACTCTTCCGTATATCTCTCTGT 5* 
Hybridization with an internal probe confirmed that the 
15 desired specific gelonin DNA fragment was amplified. That 

fragment was cloned into pUC18 and the vector generated was 
designated pING3727. The fragment was sequenced , revealing 
that the region of the fragment (the first 27 nucleotides) 
corresponding to part of the degenerate primer gelo-l could 
20 not be translated to yield the amino acid sequence upon 
which primer gelo-l was originally based. This was not 
unexpected considering the degeneracy of the primer. The 
fragment was reamplif ied from the Ge Ionium cDNA with exact 
primers gelo-11 (SEQ ID NO: 18) and gelo-5 1 (which extends 
25 upstream of the 5' end of the gelonin gene in addition to 

encoding the first 16 amino acids of gelonin) . The 
sequence of primer gelo-5 9 is set out below. 

Gelo-5 9 (SEQ ID NO: 19) 
5 • TCAACCCGGGCTAGATACCGTGTCAT 
30 TCTCAACCAAAGGTGCCACTTATATTA 3 1 

The resulting DNA fragment encodes the first 125 amino 
acids of gelonin. While the majority of the sequence is 
identical to the natural gelonin gene, the first 32 
nucleotides of the DNA fragment may be different. For the 
35 purposes of this application this N- terminal fragment is 
referred to as fragment GEL1-125. 
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3 . Cloning Of The Fragment Encoding 

The C-Terminal Amino Acids Of ggl.rm;^ 

To amplify the 3' -end of the gelonin gene as well 

as 3 f untranslated sequences, PCR amplification with exact 

5 primers gelo-9 and XE-dT was performed. The sequence of 

each of the primers is set out below. 

Gelo-9 (SEQ ID NO: 20) 

5 1 CTTCATTTTGGCGGCACGTATCC 3' 

XE-dT (SEQ ID NO: 21) 

0 3* T TTTTTT T TTTT TT TT1UUTTAG 

GGTGCATTCGAACGTCGGAGCTC 5 1 

Primer gelo-9 corresponds to amino acids 107-113 of 

gelonin. Primer XE-dT consists of a 3« oligo-dT portion 

and a 5 1 portion containing the restriction sites HlndZXl 

5 and Xhol, and will prime any poly A-containing cDNA. The 

reaction product was reamplif led with exact primers gelo-8 

and XE. The sequences of primers gelo-8 and XE are set out 

below. 

Gelo-8 (SEQ ID NO: 22) 

0 5 1 CTCGCTGGAAGGTGAGAA 3 f 

XE (SEQ ID NO: 23) 
3 9 AGGGTGCATTCGAACGTCGGAGCTC 5' 
Primer gelo-8 consists of sequences encoding amino acids 
115-120 of gelonin while the primer XE corresponds to the 

5 5 • portion of the XE-dT primer which contains H±ndlXl and 

Xhol restriction sites. Hybridization with internal probes 
confirmed that the desired gelonin gene fragment was 
amplified. That fragment was then cloned into pUClB by two 
different methods. First , it was cloned as a blunt-ended 

0 fragment into the Sm&l site of pUClB (the resulting vector 

was designated pING3728) and, second, it was cloned as an 
EcoKL to i&ndlll fragment into pUC18 ( this vector was 
designated pING3729) . Both vector inserts were sequenced. 
The insert of pING3 728 encodes amino acids 114-270 of 

5 gelonin, while the insert of pING3729 encodes amino acids 

184-270 of gelonin plus other 3 ■ sequences. 
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4. Assembly Of The Overlapping Gelonin DNA 
Fragments Tnto A Composite no^ nnin Gan»_ 

To reassemble the C-terminal two-thirds of the 
gelonin gene, vector pING3729 was cut with Sspl (one Sspl 
site is located within the vector and the second is located 
about 80 bp downstream of the termination codon of the 
insert in the vector) and an Xhol linker (8 bp, New England 
Biolabs) was ligated to the resulting free ends. The DNA 
was then cut with Xhol and EcoRl, and the 350 bp fragment 
generated , encoding amino acids 185-270 of gelonin, was 
isolated. This 350 bp fragment was ligated adjacent to a 
tfcol to ScoRl fragment from pING3823 encoding amino acids 
37-185 of gelonin. , in a intermediate vector denoted 
PING3730, thus reassembling the terminal 87% of the gelonin 
15 gene (amino acids 37-270). 

Next, fragment GEL1-125 was cut with Smal and 
Ncol, resulting in a fragment encoding amino acids 1-36 of 
gelonin which was ligated along with the Ncol to xhol 
fragment of pING3730 into the vector picioo. [pIClOO is 
identical to pING1500 described in Better, et al., Science, 
240:1041-1043 (1988), incorporated by reference herein], 
except that it lacks 37 bp upstream of the pelB leader 
sequence. The 37 bp were eliminated by digestion of 
PING1500 with Sphl and tfcoRI, treatment with T4 polymerase, 
25 and religation of the vector. This manipulation 
regenerated an ScoRI site in the vector while eliminating 
other undesirable restriction sites.] Before ligation, the 
vector pIClOO had previously been digested with SstI, 
treated with T4 polymerase, and cut with Xhol. The 
30 ligation generated a new vector containing a complete 
gelonin gene which was designated plasmid pING373l and 
deposited with The American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 on October 2, 
1991 as Accession No. 68721. The complete DNA sequence of 
35 the gelonin gene is set out in SEQ ID NO: 11. 



20 



WO 94/26910 PCTA;S94/05348 



-39- 

gxaaple 2 

A. Construction Of Expression 

Vectors Containing The Gelonir^ r,ono 

A first E. coll expression vector was constructed 
5 containing the gelonin gene linked to the Erwtnia 

carotovora pelB leader sequence, and to the Salmon&lla 
typhlmurium araB promoter. A basic vector containing the 
araB promoter is described in co-owned U.S. Patent No. 
5,028,530 issued July 2, 1991 which is incorporated by 

10 reference herein. The vector containing the araB promoter 

was cut with ZcoRI and Xhol. Two DNA fragments were then 
ligated in tandem immediately downstream of the promoter. 
The fragment ligated adjacent to the promoter was a 131 bp 
fragment derived from SstI digestion, T4 polymerase 

15 treatment and digestion with EcoRl of the pIClOO vector 
which includes the leader sequence of the E. carotovora 
pelB gene. The translated leader sequence is a signal for 
secretion of the respective protein through the cytoplasmic 
membrane. The fragment ligated downstream of the leader 

20 sequence was a Smal to XhoX fragment from pING3731 which 
contains the complete gelonin gene. Thus, the expression 
vector contains the gelonin gene linked to the pelB leader 
sequence and the araB promoter. This plasmid is designated 
PING3733. 

25 A second expression vector may be constructed 

that is identical to the first except that the gelonin gene 
sequences encoding the nineteen C- terminal amino acids of 
gelonin are not included. The cONA sequence of the gelonin 
gene predicted a 19 residue C-terminal segment that was not 

30 detected in any peptide fragments generated for 
determination of the gelonin amino acid sequence. These 19 
amino acids may represent a peptide segment that is cleaved 
from the mature toxin post- trans la tionally, i.e. that is 
not present in the native protein. A similar C-terminal 

35 amino acid segment was identified in the plant toxin 

a-trichosanthin [Chow et al., *7. Biol. Chem,, 265:8670-8674 
(1990)]. Therefore, the expression product without the C- 
terminal fragment is of interest. 
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For construction of a gelonin expression vector 
without the 19 C-terminal amino acids of gelonin, PCR was 
used to amplify and alter the 3* -end of the gene. plNG3728 
was amplified with primers gelo-14 and gelo-9 (SEQ ID NO: 
5 20) . The sequence of primer gelo-14 is set out below. 

Gelo-14 (SEQ ID NO: 24) 

51 TGATCTCGAGTACTATTTAGGATCTTTATCGAgGA 3« 
Primer gelo-14, which corresponds to gelonin amino acids 
245-256, introduces a termination codon (underlined in the 

10 primer sequence) in the gelonin gene sequence which stops 
transcription of the gene before the sequences encoding the 
terminal 19 amino acids of the gelonin and also introduces 
a Xhol site immediately downstream of the termination 
codon. The PCR product was cut with Xhol and EcoKl, and 

15 the resulting 208 bp fragment encoding amino acids 185-251 
of gelonin was purified from an agarose gel. This fragment 
was ligated adjacent to the Ncol to EcoKI fragment from 
pING3823 encoding amino acids 37-185 of gelonin to generate 
plasmid pING3732. A final expression vector, plNG3734, 

20 containing a gelonin gene with an altered 3* -end was 

generated by substituting an Ncol to Xhol fragment encoding 
amino acids 37-251 of gelonin from pING3732 into pING3733. 

B. Identification Of The Native Gelonin 5 '-End 

Inverse PCR was used to identify a cDNA clone 
25 encoding the 5 9 -end of the mature gelonin gene. 5 of 
total G. multlflorum RNA was converted to cDNA using the 
Superscript Plasmid System (BRL, Gaithersburg, Maryland) 
with Gelo-11 (SEQ ID NO: 18) as a primer. Gelonin cDNA was 
self-ligated to generate covalent circular DNA and the 
30 ligated DNA was amplified by PCR with oligonucleotides 

Gelo-9 (SEQ ID NO: 20) and Gelo-16. The sequence of primer 
Gelo-16 is set out below. 

Gelo-16 (SEQ ID NO: 25) 
5 1 GTAAGCAGCATCTGGAGCATCT 3 1 
35 The PCR product was size-fractionated on an agarose gel and 
DNAs larger than 300 bp were cloned into Smal cut pUC18. 
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Several clones were sequenced with the primer Gelo-is, the 
sequence of which is set out below. 

Gelo-18 (SEQ ID NO: 26) 
5 1 1 CATTCAAG AAATT CACGTAGG 3 1 
5 A clone identified as having the largest gelonin-specif ic 
insert was designated pING3826. The DNA sequence of 
PING3826 included .the first 32 nucleotides of the natural, 
mature gelonin gene not necessarily present in gelonin 
expression plasmids pING3733 and pING3734. The complete 
10 DNA sequence of the natural gelonin gene is set out in SEQ 
ID NO: 11. 



C. construction* Of Expression Vectors 

containing A Gelonin Gene With A Natural 5' End 

Derivatives of expression vectors pING3733 and 

15 pING3734 (described above) containing a gelonin gene with 
the natural 5' sequence were generated as follows* The 5'- 
end of gelonin was amplified from pING3826 with the PCR 
primers Gelo-16 (SEQ ID NO: 24) and Gelo-17, the sequence 
of which is set out below. 

20 Gelo-17 (SEQ ID NO: 27) 

5* GGCCTGGACACCGTGAGCTTTAG 3* 
The 285 bp PCR product was treated with T4 polymerase and 
cut with tfcol. The resulting 100 bp 5 f -end DNA fragment 
was isolated from an agarose gel and ligated adjacent to 

25 the 120 bp pelB leader fragment from plClOO (cut with SstI, 
treated with T4 polymerase and cut with PstI) into either 
PING3733 or pING3734 digested with PstI and tfcol. The 
resulting plasmids pING3824 and pING3825 contain the entire 
native gelonin gene and the native gelonin gene minus the 

30 nineteen amino acid carboxyl extension, respectively, 
linked to the pelB leader and under the transcriptional 
control of the araB promoter. The gene construct without 
the nineteen amino acid carboxyl extension in both pING3734 
and pING3825 encodes a protein product referred to in this 

35 application as "recombinant gelonin*. 



WO 94/26910 



PCT/US94/05348 



-42- 

D. Purif ication O f peeom binant GelpfHr^ 

Recombinant gelonin was purified by the following 
procedure: E. coll fermentation broth was concentrated and 
buffer-exchanged to 10 mM sodium phosphate at pH 7.0 by 
5 using an S10Y10 cartridge over a DC10 unit (Amicon) the 
concentrated and buffer-exchanged material was applied to 
a CMS 2 column (100 g, 5X10 cm)* The column was washed with 
1 L of starting buffer and eluted with a 0 to 300 mM NaCl 
gradient in starting buffer (750 ml total volume) . The 

10 pure gelonin containing fractions were pooled (elution was 
from 100-250 mM NaCl) , concentrated over an Amicon YM10 
membrane , equilibrated with 10 mM sodium phosphate buffer , 
pH 7.0, and stored frozen at -20*C. A further purification 
step was attempted using Blue Toyopearl chromatography. 

15 However, this procedure did not result in an increased 
purity of material and resulted in an approximate 50% loss 
of the starting material. 

groupie ? 

Assembly Of Gelonin Genes With 
20 cysteine Residues Available For Conjugation 

The wild-type gelonin protein has two cysteine 

residues at positions 44 and 50 which are linked by an 

endogenous disulfide bond. The protein contains no free 

cysteine residue directly available for conjugation to 

25 antibodies or other proteins. Analogs of gelonin which 
contain a free cysteine residue available for conjugation 
were generated by three different approaches. In one 
approach, various residues along the primary sequence of 
the gelonin were replaced with a cysteine residue, creating 

30 a series of analogs which contain an odd number of cysteine 
residues. In another approach, one of the two endogenous 
cysteines was replaced by alanine, creating a molecule 
which lacks an intrachain disulfide bond but contains a 
single, unpaired cysteine. In yet another approach both 

35 endogenous cysteines were replaced by alanines and a third 
non-cyst eine residue was replaced by a cysteine, creating 
an analog with a single, unpaired cysteine. 
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Fifteen analogs of gelonin were constructed. Ten 
non-cysteine residues of gelonin were targeted for 
substitution with a cysteine residue, comparison of the 
amino acid sequence of gelonin to the natural amino acid 
5 sequence and tertiary structure of the ricin A-chain (see 

FIG* l) suggested that these positions would be at the 
surface of the molecule and available for conjugation. 
Each of the ten gelonin analogs include a cysteine 
substituted in place of one of the following residues: 
10 lysine l0 , asparagine 60 , isoleucine l03 , aspartic acid 144 , 

arginine 184 , serine 213 , asparagine 23 , , lysine 244 , aspartic 
acid 247 , and lysine 24S , and the analogs have respectively been 
designated Gelc 10 , Gel C60 , Gel cl03 , Gelc 146 , Gel cl84 , Gel^, 

Gel C23»f Gd lc24*/ Gel C247> and Ge lc2*a • 

15 Two analogs of gelonin were constructed in which 

one of the native gelonin cysteines that participates in an 
endogenous disulfide bond was replaced with a non -cysteine 
residue. Specifically, the cysteine at position 50 was 
replaced with an alanine residue, creating a gelonin analog 

20 (designated Gel^o^), shown in SEQ ID NO: 99) which has a 

cysteine available for disulfide bonding at position 44. 
The Gel^o (c 4 4) analog has been referred to previously as Gelg 44 
(see, e.g., co-owned, co-pending U.S. Patent Application 
Serial No. 07/988,430, incorporated by reference herein). 

25 Conversely, the cysteine at position 44 was replaced with 
an alanine residue, resulting in an analog (designated 
G el*44(C30)# shown in SEQ ID NO: 100) which has a cysteine 
available for disulfide bonding at position 50. The 
Gel A44(C50) analog has been referred to previously as Gel^o 

30 (see, e.g., co-owned, co-pending U.S. Patent Application 

Serial No. 07/988,430, incorporated by reference herein). 
The combined series of the foregoing twelve analogs thus 
spans the entire length of the mature gelonin protein. 

Another gelonin analog (Gel A44A5 o SEQ ID NO: 101) 

35 was constructed in which both native gelonin cysteines were 
replaced with alanines. The Gel A44A30 analog has been 
referred to previously as Gel C44AC50 A (see, e.g., co-owned, co- 
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pending U.S. Patent Application Serial No. 07/988 , 430, 
incorporated by. reference herein) . Two additional analogs 
were constructed which have alanine residues substituted in 
place of both native cysteines and have either a cysteine 
5 residue substituted in place of the native lysine at 

position 10 (Gel cl0A44A3fl , shown in SEQ ID NO: 110) or a 
cysteine residue* substituted in place of the native 
aspartate at position 247 (Gel C24 7A4*Asof shown in SEQ ID NO: 
111) . 

10 The variants of. recombinant gelonin were 

constructed by restriction fragment manipulation or by 
overlap extension PCR with synthetic oligonucleotides. The 
sequences of the primers used for PCR are set out below. 
In each mutagenic primer sequence, the nucleotides 
15 corresponding to the changed amino acid, either a cysteine 
or an alanine residue, are underlined. 

Gelo-9 (SEQ ID NO: 20) 
Gela-11 (SEQ ID NO: 18) 
Gelo-16 (SEQ ID NO: 25) 
20 Gelo-17 (SEQ ID NO: 27) 

Gelo-18 . (SEQ ID NO: 26) 

Gelo-19 (SEQ ID NO: 58) 

5 # CAGCCATGGAATCCCATTGCTG 3' 

GeloC-1 (SEQ ID NO: 28) 
25 5 • TCGATTGCGATCCTAAATAGTACTC 3 1 

GeloC-2 (SEQ ID NO: 29) 

5 » TTTAGGAT CGCAA TCGACGAACTTCAAG 3* 

GeloC-3-2 (SEQ ID NO: 30) 

5» GTTCGTC TGTA AAGAT C CTAAATAGTACT CGA 3 • 



30 



GeloC-4 (SEQ ID NO: 31) 

5* GGATCTTTACAGACGAACTTCAAGAGT 3' 
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GeloC-5 (SEQ ID NO: 32) 

5 • TCTTGT££TTCGTCGATAAAGATCC 3 1 

GeloC-6 (SEQ ID NO: 33) 

5 • ATCGACGAASS&CAAGAGTGCTATTTT 3 1 

GeloO-9 (SEQ ID NO: 34) 

5 ' GTAAAACC ATGCA TAGCACTCTTGAAGTTCGT 3 1 

GeloC-10 (SEQ ID NO: 35) 

5 1 AGTGCTA TGCA TGGTTTTACTTGATCAACTGC 3' 

GeloO-13 (SEQ ID NO: 36) 

5 1 AGCACATSIGGTGCCACTTATATTACCTA 3 r 

GeloC-14 (SEQ ID NO: 37) 

5 1 TAAGTGGCACCaSATGTGCTAAAGCTCACGGTG 3 f 

GeloC-15 (SEQ ID NO: 38) 

5' TGAC TGT GGACAGTTGGCGGAAATA 3* 

GeloC-16 (SEQ ID NO: 39) 

5 • GCCAACTGTCCACAGTCATTTGAAAGCGCTACC 3' 
GeloC-17 (SEQ ID NO: 40) 

5 f GATGATCCTGGAAA GGCTT TCGTTTTGGTAGCGCTT3 ' 

GeloC-18 (SEQ ID NO: 41) 
5 1 AAGC CTTTCCAGGATCATCAGC 
TTTTTTGCGCAGCAATGGG 3 ■ 

GeloC-19 (SEQ ID NO: 42) 

5 1 A AGC CTTTCCAGGATCATCACAT 3 ' 

GeloC-20 (SEQ ID NO: 59) 

5 f CAC ATGTA AAACAAGACTTCATTTTGGC 3 1 
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GeloC-21 (SEQ ID NO: 60) 

5 1 TGAAGTCTTGTTTT^^TGTGTTTTTGAAG AGGC CT 3 f 

GeloC-22 (SEQ ID NO: 61) 

5 • ATGCCATAIS£AATTATAAACCAACGGAGA 3 • 

5 GeloC-23 (SEQ ID NO: 62) 

5 • GGTTTATAATT GCA TATGG 
CATTTTCATCAAGTTTCTTG 3 1 

GeloC-24 (SEQ ID NOt 63) 

5 1 CTTTCAACAAIS£ATTCGCCCGGCGAATAATAC 3 1 

10 GeloC-25 (SEQ ID NO: 64) 

5 • GCGAATSS&TTGTTGAAAGTTATTTCTAATTTG 3 * 

GeloC-26 (SEQ ID NO: 65) 

5 1 GTTT TGT GAGGCAGTTGAATTGGAAC 3 • 

GeloC-27 (SEQ ID NO: 66) 
15 5' TTCAACTGCCT CACAA AACATTCCATTTGCACCT 3 1 

GeloC-28 (SEQ ID NO: 67) 

5 1 AAAASdGATGATCCTGGAAAGTG 3* 

GeloC-29 (SEQ ID NO: 68) 

5» TCCAGGATCATC&££TTTTTTGCGCAGCAATGGGA 3 • 

20 araB2 (SEQ ID NO: 43) 

5' GCGACTCTCTACTGTTTC 3 f 

HINDX1Z-2 (SEQ ID NO: 44) 
5' CGTTAGCAATTTAACTGTGAT 3 ■ 



25 



(1) Specifically, a cysteine was introduced at 
amino acid 247 of gelonin (which is normally occupied by an 
aspartic acid which corresponds to the cysteine at position 



WO 94/26910 



PCT/US94/05348 



10 



•47- 

259 in the ricin A-chain) by pcr with mutagenic primers 
GeloC-3-2 and GeloC-4 in conjunction with primers HINDXXX-2 
(a primer located in the vector portion of plNG3734 or 
pING3825), Gelo-9 and GelO-8. Template DNA (pING3734) was 
amplified with GeloC-3-2 and HINDXXX-2 and in a concurrent 
reaction with GeloC-4 and Gelo-9. The products of these 
reactions were mixed and amplified with the outside primers 
Gelo-8 and HINDXXX-2. The reaction product was cut with 
EcoRX and Xhol, purified, and was inserted into plasmid 
PING3825 in a three-piece ligation* The DNA sequence of 
the Gelc 247 variant (SEQ ID NO: 102) was then verified. The 
plasmid containing the sequence encoding Gelc 247 was 
designated pING3737 and was deposited with the American 
Type culture Collection, 12301 Parklawn Drive, Rocfcville, 
15 MD 20852 on June 9, 1992 as ATCC Accession No. 69009. 

(2-3) In the same manner, a cysteine residue was 
introduced in place of the amino acid at position 248 (a 
lysine) of gelonin with the mutagenic oligonucleotides 
GeloC-1 and GeloC-2 to generate analog Gel^e (SEQ id no: 
20 103) in plasmid pING3741, and a cysteine residue was 
introduced at amino acid position 239 (normally occupied by 
a lysine) with primers GeloC-9 and GeloC-10 to generate 
analog Gel 23 , (SEQ ID NO: 104) in plasmid pING3744. 

(4) Also in the same manner, a cysteine residue 
25 was introduced at amino acid 244 (a lysine) of gelonin with 

mutagenic primers GeloC-5 and GeloC-6 to generate analog 
G *lc244 (SEQ ID NO: 105) in a plasmid designated pING3736. 
This variant was prepared by PCR using plasmid plNG3734 as 
template DNA rather than pING3825. It therefore encodes 
30 the same N- terminal gelonin amino acid sequence as plasmids 
PING3737, pING3741, and pING3744, but includes the PCR 
primer-derived 5 f -end 32 nucleotides instead of the native 
gelonin 5 '-end nucleotides. 

(5) A cysteine residue was introduced in place of 
35 the amino acid (normally occupied by a lysine) at position 

10 of gelonin by a similar procedure. A cysteine was 
introduced with mutagenic primers GeloC-13 and GeloC-14 by 
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amplifying plNG3824 with araB2 (a vector primer) and GeloC- 
14, and in a separate reaction, with GeloC-13 and Gelo-11. 
These reaction products were mixed and amplified with the 
outside primers araB2 and Gelo-11. The PCR product was cut 
5 with PstI and tfcol, purified, and cloned back into pING3825 

to generate analog Gel ci0 (SEQ ID NO: 106) in the plasmid 
designated pING3746 and deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, RocJcville, MD 
20852 on June 9, 1992 as ATCC Accession No. 69008. 

10 (6) The asparagine at position 60 of gelonin was 

replaced with a cysteine residue using two mutagenic 
oligos, GeloC-15 and GeloC-16, in conjunction with oligos 
araB2 and Gelo-11 in the same manner as for the Gel cl0 
variant. The plasmid encoding the Gelc 60 (SEQ ID NO: 107) 

15 analog was designated pING3749. 

(7) A cysteine was introduced at amino acid 103 
(an isoleucine) by PCR with mutagenic primers GeloC-20 and 
GeloC-21 in conjunction with primers araB2 and HJNDIII-2. 
Template DNA (pING3733) was amplified with GeloC-21 and 

20 araB2 and separately with GeloC-20 and HJNDIII-2. The 
products of these reactions were mixed and amplified with 
the outside primers araB2 and HJOTIII-2. The reaction 
product was cut with Ncol and Bell, purified, and inserted 
into pING3825 digested with tfcol and Bell. The 

25 oligonucleotides used to place a cysteine at residue 103 

also introduced an Afllll restriction site which was 
verified in the cloned gene. The plasmid containing the 
G*lcio3 (SEQ ID NO: 108) analog was designated pING3760. 

(8) A cysteine was introduced at position 146 
30 (an aspartic acid) by a similar strategy. Template DNA 

(pING3733) was amplified with mutagenic primer GeloC-22 and 
Gelo-14 and separately with mutagenic primer GeloC-23 and 
Gelo-19. The products of these reactions were mixed, and 
amplified with Gelo-19 and Gelo-14. The reaction product 
35 was cut with Bgl II and EcoRl, and can be inserted into 
pING3825 in a three-piece ligation. The oligonucleotides 
used to place a cysteine at residue 146 also introduced a 
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Ndel restriction site which can be verified in the cloned 
gene. 

(9) To introduce a cysteine at position 184 
(normally occupied by an arginine) of gelonin, template DNA 

5 (pING3733) was amplified with mutagenic primer Geloc-25 and 

araB-2 and separately with mutagenic primer GeloC-24 and 
HJtfDIIX-2. The products of these reactions were mixed , and 
amplified with araB2 and Gelo-14. The reaction product was 
cut with tfcol and Bell, and inserted into pING3825 

10 previously digested with Ncol and Bell. The 
oligonucleotides used to place a cysteine at residue 184 
also introduced an Nsil restriction site which was verified 
in the cloned gene. The plasmid containing the sequence 
encoding the Gel^,* (SEQ ID NO: 109) variant was designated 

15 pING3761. 

(10) A cysteine may be introduced at position 
215 (a serine) by a similar strategy. Template DNA 
(pING3733) was amplified with mutagenic primer GeloC-27 and 
araB2 and separately with mutagenic primer GeloC-26 and 

20 BTNDIII-2. The products of these reactions were mixed , and 
amplified with araB2 and ffJMDXXX-2. The reaction product 
was cut with EcoRX and BclX, and may be inserted into 
pING3825 in a three-piece ligation. 

(11) Another gelonin variant with a free cysteine 
25 residue was generated by replacing one of the two naturally 

occurring gelonin cysteine residues, the cysteine a 
position 50, with an alanine. Plasmid pING3824 was 
amplified with primers GeloC-17 and Gelo-ll f and 
concurrently in a separate reaction with primers GeloC-19 

30 and araB2. The reaction products were mixed and amplified 
with araB2 and Gelo-11. This product was cut with tfcol and 
BgrllX, and cloned back into the vector portion of pING3825 
to generate pING3747 (ATCC 69101) . This analog was 
designated Gel^cc^) it contains a cysteine available for 

35 disulfide bonding at amino acid position 44. Non-cysteine 
residues, other than alanine, which do not disrupt the 
activity of gelonin, also may be inserted at position 50 in 
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natural gelonin in order to generate a gelonin analog with 
a single cysteine at position 44. 

(12) A gelonin variant in which the natural 
cysteine at position 44 was changed to alanine was 

5 constructed by amplifying pING3733 using the mutagenic 

oligonucleotides GeloC-28 and GeloC-29 in conjunction with 
■ primers ar>aB2 and HJtfDIII-2. The amplified DNA was cut 
with Ncol and Bgrlll and cloned into a gelonin vector, 
generating pING3756. That variant generated was designated 
10 Gel A Mcc3o>* Non-cysteine residues, other than alanine, which 
do not disrupt gelonin activity, also may be inserted at 
position 44 in order to generate a gelonin analog with a 
single cysteine at position 50. 

(13) A gelonin variant in which both the 
15 cysteine at position 44 and the cysteine at position 50 of 

gelonin were changed to alanine residues was constructed by 
overlap PCR of pING3824 using the mutagenic 
oligonucleotides GeloC-17 and GeloC-18 in conjunction with 
primers araB2 and Gelo-11. This analog, like the native 

20 gelonin protein, has no cysteine residues available for 
conjugation. The plasmid encoding the analog was 
designated pING3750. The analog generated was designated 
Gel A * 4A50 (SEQ ID NO: 101). Non-cysteine residues, other than 
alanine, which do not disrupt gelonin activity, also may be 

25 substituted at both positions 44 and 50 in order to 

generate a gelonin analog with no cysteine residues. 

(14) The triple mutant Gelonin^A^xjo (SEQ ID NO: 
111) was constructed from the plasmids pING3824, pING3750 
and pING3737. This variant contains an introduced cysteine 

30 at position 247 while both of the naturally occurring 

cysteine residues at positions 44 and 50 have been replaced 
with alanine. The analog is desirable because, in this 
analog, disulfide linkage to an antibody is only assured at 
a single cysteine residue. Plasmid pING3824 was cut with 

35 Ncol and Xhol and the vector fragment was purified in an 
agarose gel. pING3750 was cut with Ncol and EcoRI and 
PING3737 was cut with ScoRI and Xhol* The tfcoI-ScoRI 
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fragment encodes the alanines at positions 44 and 50 while 
the EcoRl-Xhol fragment encodes the cysteine at position 
247. Each of these fragments was purified and ligated to 
the Ncol to Xhol vector fragment. The resulting plasmid is 
5 named pING3752. 

(15) The triple mutant Gelon^^*^ (SEQ ID NO: 
110) was also constructed by assembly from previously 
assembled pi asm ids. In this case, pING3746 was cut with 
PstI and tfcol, while pING3750 was cut with Ncol and Xhol. 

10 Each of the insert fragments were purified by 

electrophoresis in an agarose gel, and the fragments were 
ligated into a PstI and Xhol digested vector fragment. The 
resulting vector was designated pING3753. The Gel clOA44X50 
analog has been referred to previously as Gelc 1Q c4McsaA (see, 

15 e.g., co-owned, co-pending U.S. Patent Application Serial 

No. 07/988,430, incorporated by reference herein). 

Each of the gelonin variants constructed was 
transformed into E. coli strain E104. Upon induction of 
bacterial cultures with arabinose, gelonin polypeptide 

20 could be detected in the culture supernatants with gelonin- 

specific antibodies. There were no significant differences 
detected in the expression levels of gelonin from plasmids 
PING3734 and pING3825, or in the levels from any of the 
gelonin variants. Each protein was produced in E. coli at 

25 levels of approximately 1 g/1. 



Reticulocyte Lvsate Assay 

The ability of gelonin and recombinant gelonin 
analogs to inhibit protein synthesis in vitro was tested 

30 using a reticulocyte lysate assay (RLA) described in Press 
et al., Immunol. Letters, 14:37-41 (1986). The assay 
measures the inhibition of protein synthesis in a cell-free 
system using endogenous globin mRNA from a rabbit red blood 
cell lysate. Decreased incorporation of tritiated leucine 

35 (3H-Leu) was measured as a function of toxin concentration. 

Serial log dilutions of standard toxin (the 30 kD form of 
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ricin A-chain, abbreviated as RTA 30), native gelonin, 
recombinant gelonin (rGelonin or rGel) and gelonin analogs 
were tested over a range of 1 Mg/ml to 1 pg/ml. Samples 
were tested in triplicate, prepared on ice, incubated for 
3 0 minutes at 37 *c, and then counted on an Inotec Trace 96 
cascade ionization counter. By comparison with an 
uninhibited sample, the picomolar concentration of toxin 
(pM) which corresponds to 50% inhibition of protein 
synthesis (IC 30 ) was calculated. As is shown in Table l 
below, recombinant gelonin and most of its analogs exhibit 
activity in the RLA comparable to that of native gelonin. 
For some of the analogs (such as Gel C239 ) , RLA activity was 
diminished. 

Table 1 
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Bxample s 

Human -Engineered Antibodies 

For Construction Of Imaunotoxins 

Antibodies for use in constructing immunotoxins 

5 according to the present invention may be humanized 

antibodies , such as he3 and fragments thereof which display 

increased content of human amino acids and a high affinity 

for human CD5 cell differentiation marker. he3 is a 

humanized form of a mouse H65 antibody (H65 is a preferred 

10 monoclonal antibody for use in preparing humanized 
antibodies according to the present invention and is 
produced by hybridoma cell line XMMLY-H65 (H65) deposited 
with the American Type Culture Collection in Rockville, 
Maryland (A.T.C.C.) and given the Accession No* HB9286) . 

15 Humanized antibodies for use in the present 

invention are prepared as disclosed herein using the 
humanized forms of the murine H65 antibody in which both 
low and moderate risk changes described below were made in 
both variable regions. Such humanized antibodies should 

20 have less immunogenic ity and have therapeutic utility in 
the treatment of autoimmune diseases in humans. For 
example, because of their increased affinity over existing 
therapeutic monoclonal antibodies such as H65, he3 
antibodies of the invention may be administered in lower 

25 doses than H65 ant i -CD 5 antibodies in order to obtain the 

same therapeutic effect. 

Humanized antibodies, such as he3, are useful in 
reducing the immunogenic ity of foreign antibodies and also 
results in increased potency when used as a portion of an 

3 0 immunocon jugate . 

Construction of humanized antibody variable 
domains according to the present invention and for use as 
components of immunotoxins may be based on a method which 
includes the steps of: (1) identification of the amino acid 

35 residues of an antibody variable domain which may be 

modified without diminishing the native affinity of the 
domain for antigen while reducing its immunogenicity with 
respect to a heterologous species; and (2) the preparation 
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of antibody variable domains having modifications at the 
identified residues which are useful for administration to 
5 heterologous species. The methods of the invention are 
based on a model of the antibody variable domain described 
herein and in U.S. patent application no. 07/808,464 by 
Studnicka, et al., which predicts the involvement of each 
amino acid in the structure of the domain* 

10 Unlike other methods for humanization of 

antibodies, which advocate replacement of the entire 
classical antibody framework regions with those from a 
human antibody, the methods described herein and in U.S. 
patent application no. 07/808,464 by Studnicka, et al., now 

15 abandoned, introduce human residues into the variable 
domain of an antibody only in positions which are not 
critical for antigen -binding activity and which are likely 
to be exposed to immunogenic! ty- stimulating factors. The 
present methods are designed to retain sufficient natural 

20 internal structure of the variable domain so that the 
antigen-binding capacity of the modified domain is not 
diminished in comparison to the natural domain. 

The human consensus sequences in which moderate 
risk residues are converted from mouse residues to human 

25 residues are represented in Figures 10A and 10B as lines 
labelled hKl (i.e., subgroup 1 of the human kappa chain) 
and hH3 (i.e., subgroup 3 of the human heavy chain). 
Symbols in the figures for conservation and for risk in 
"bind" and "bury" lines are as follows: 

30 

First Symbol in Pair (Ligand Binding) 
+ Little or not direct influence on antigen- 
binding loops, low risk if substituted 
° Indirectly involved in antigen-binding loop 
35 structure, moderate risk if changed 

Directly involved in antigen-binding loop 
conformation or antigen contact, great risk 
if modified 
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Second Symbol in Pair ( Immunogenicity/ Struture) 
+ Highly accessible to solvent, high 

immunogenicity, low risk if substituted 
° Partially buried, moderate immunogenicity, 
5 moderate risk if altered 

- Completely buried in subunit's hydrophobic 

core, low immunogenicity, high risk if 

changed 

= Completely buried in interface between 
10 subunits, low immunogenicity, high risk if 

modified 



Significance of Pairs 
++ Low risk 

Highly accessible to solvent 
15 and high immunogenicity, but 

little or no effect on specific 
antigen binding 
°+# +•# *° Moderate Risk 

Slight immunogenicity or indirect 
20 involvment with antigen binding 

any - or ~ High risk 

Buried within the subunit core/ 
interface or strongly involved in 
antigen binding, but little immunogenic 
25 potential 

In the line labelled "mod", a dot (.) represents 
a residue which may be mutated from "mouse" to "human" at 
moderate risk. There are 29 such moderate risk positions* 

The mouse residue matches the human consensus 
30 residue more than 50% of the time at 131 positions (102 
positions match 90%-10Q% and 29 positions match 50% to 
90%) . These positions were not changed. 

The lines labelled M/H in Figures 12A and 12B 
indicate the 91 positions which differed significantly 
3 5 between the mouse and human sequences (i.e., where the 
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human sequences have the mouse residue less than 50% of the 
time). Moderate risk positions, designated m in the M/H 
line, were kept "mouse"; whereas those designated H or h 
were changed to human. The 25 low risk positions which 
were already human-like or which were previously humanized 
(as described supra in Example 2) are designated " A *• in 
the M/H line. Finally, the 54 high risk positions in which 
the mouse and human residues did not match are designated 
M and are kept "mouse ". 

Fifteen differences occur at moderate risk 
positions at which the mouse and human sequences differ. 
At ten of those positions (designated "H" on the M/H lines 
of Figure 6) the mouse residue aligns with a human 
consensus amino acid which is highly conserved. Therefore, 
the mouse residue at that position is identified as one to 
be changed to the conserved human residue. 

At moderate risk positions (designated "m") in 
which the mouse and the human sequences differ, the mouse 
residue aligns with a human consensus amino acid which is 
moderately conserved. However, since the mouse residue is 
found at that position in other actual sequences of human 
antibodies [See Kabat, et al., sequences of Proteins of 
Immunoglobulin Interest, Fourth Edition, U.S. Department of 
Health and Human Services, Public Health Service, National 
Institutes of Health (1987)] the positions are identified 
as ones to be kept "mouse." Although there are no such 
positions in this particular sequence, such positions may 
occur in other antibodies. 

At four moderate risk positions (designated "h") , 
the mouse residue aligns with a human consensus amino acid 
which is moderately conserved but the mouse residue is not 
found at that position in an actual human antibody sequence 
in Kabat, et al. Sequences of Proteins of Immunoglobulin 
Interest, supra. Therefore, that position is identified as 
ones to be changed to "human. " 

At one moderate risk position (designated "m") in 
which the mouse and human sequences differ, the mouse 
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residue aligns with a human consensus amino acid which i 
poorly conserved. Therefore, that position is identified 
as one to be kept "mouse." 

A. Assembly Of Moderate Risk 

Heavy Chain Expression Vectors 

The humanized H65 heavy chain containing the 

moderate risk residues was assembled by the following 

strategy. The moderate-risk expression vector was 

assembled from intermediate vectors. The six 

oligonucleotide sequences (oligos) , disclosed in Figure 12 

and labelled HUH-G11, HUH-G12 , HUH-G3, HUH-G4, HUH-G5, and 

HUH-G6 (the sequences of HUH-G11 and HUH-G12 are set out in 

SEQ ID Nos. 131 and 132 and HUH-G3, HUH-G4, HUH-G5, and 

HUH-G6 are set out in SEQ ID NOS: 137-140) were assembled 

by pcr. Oligonucleotides containing the synthetic 

humanized antibody gene were mixed in pairs (HUH-Gli + 

HUH-G12, HUH-G3 + HUH-G4 , and HUH-G5 + HUH-G6) in a 100 m! 

reaction with 1 of each DNA and filled in as described 

above. A portion of each reaction product was mixed in 

pairs (HUH-G11, 12 + HUH-G3, 4; HUH-G3, 4 + HUH-G5, 6) , 2.5 

U Taq was added and samples were reincubated as described 

above. The V-J region was assembled by mixing equal 

amounts of the HUH-Gll, 12 , 3, 4 reaction product with the 

HUH-G3, 4, 5, 6 product, followed by PCR with 0.5 ug of 

primers H65G-2S and H65-G2 as described above. The 

reaction product was cut with Sail and SstEII and cloned 

into the expression vector, similar to that described for 

heavy chain in Robinson et a!., Htm. Antibod. Hybridomas 

2:84 (1991) , generating pING4617. That plasmid was 

sequenced with Sequenase (USB f Cleveland) , revealing that 

two residues were altered (a G-A at position 288 and a a-T 

at position 312, numbered from the beginning of the leader 

sequence) . The correct variable region was restored by 

substitution of this region from pING4612, generating the 

expected V-region sequence in pING4619. 

An intermediate vector containing the other 

moderate-risk changes was constructed by PCR assembly of 
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the oligos HUH-G13, HUH-G14, HUH-G15, and HUH-G16 (Fig* 11 
and SEQ ID Nos: 13 3-13 6) . Oligos HUH-G13 + HUH-G14 and 
HUH-G15 +■ HUH-G16 were mixed* and filled in with Vent 
polymerase (New England Biotabs) in a reaction containing 
5 10 mM KCl f 20 mM TRIS pH 8,8, 10 mM (NH<) 2 SO z , 2mM MgSO*, 

0-1% Triton X-100, 100 ng/ml BSA, 200 uM of each dNTP, and 
2 units of Vent polymerase in a total volume of 100 /ii. 
The reaction mix was incubated at 94 °c for l minute, 
followed by 2 minutes at 50 °C and 20 minutes at 72 °C. The 

10 reaction products (40 Ml) were mixed and amplified with the 
oligonucleotides H65-G13 and H65-G2 with Vent polymerase in 
the same reaction buffer and amplified for 25 cycles with 
denaturation at 94 °C for 1 minute, annealing at 50°c for 2 
minutes and polymerization at 72 °C for 3 minutes. The 

15 reaction product was treated with T4 polymerase and then 

digested with Accl. The 274 base pair (bp) fragment was 
purified on an agarose gel and ligated along with the 141 
bp Sail to AccI fragment from pING4619 into pUC18 cut with 
Sail and Smal to generate pING4620. plNG4620 contains the 

20 entire signal sequence, V-region, and J-region of the 
moderate-risk H65 heavy chain. 

The final expression vector for the moderate-risk 
H65 heavy chain , pING4621, was assembled by cloning the 
Sail to BstEXl fragment from pING4620 into the same 

25 expression vector described above. 

B. Assembly Of Moderate-Risk 

Light Chain Expression Vectors 

The moderate-risk humanized V- and J-segments of 

the light chain were assembled from six oligonucleotides, 

30 $H65K-1 (SEQ ID NO: 117), HUH-K7 (SEQ ID NO: 119), HUH-K6 

(SEQ ID NO: 118), HUH— KB (SEQ ID NO: 120), HUH-K4 (SEQ ID 
NO: 121 and HUH-K5 (SEQ ID NO: 122). The oligonucleotides 
were amplified with PCR primers H65K-2S and JKl-Hindlll. 
Oligonucleotides containing the synthetic humanized 

35 antibody gene were mixed in pairs ($H65-K1 + HUH-K7, HUH-K6 

+ HUH-K4 + HUH-K5) and incubated with Vent polymerase as 
described for the moderate-risk heavy chain. A portion of 
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each reaction product (40 ul) was nixed in pairs 
<$H65H-K1/HUH-K7 + HUH-K6 , 8; HUH-K6 , 8 + HUH-K4 , 5) and 
filled in as above. The light chain gene was then 
assembled by amplifying the full length gene with the PCR 
5 primers H65K-2S and JKl-ffindlll with Vent polymerase for 25 
cycles as outlined above. The assembled V/J region was cut 
with Sail and Hindlll, purified by electrophoresis on an 
agarose gel, and assembled into a light chain antibody 
expression vector, pING4630. 

10 Example 6 

Transfection Of he3 Genes And 
Purification Of Expression Products 

A. Stable Transfection Of Mouse Lymphoid 

Cells For The Production Of he3 Antib^y 

15 The cell line Sp2/0 (American Type Culture 

Collection Accession No. CRL1581) was grown in Dulbecco's 
Modified Eagle Medium plus 4.5 g/1 glucose (DMEM, Gibco) 
plus 10* fetal bovine serum. Media were supplemented with 
glutamine/penicillin/streptomycin (Irvine Scientific, 

20 Irvine, California) . 

The electroporation method of Potter, H. , et al . , 
Proc. Natl. Acad. Sci., USA, 82:7161 (1984) was used. 
After transfection, cells were allowed to recover in 
complete DMEM for 24-48 hours, and then seeded at 10,000 to 

25 50,000 cells per well in 96-well culture plates in the 

presence of selective medium. Histidinol (Sigma) selection 
was at 1.71 Mg/ml, and mycophenolic acid (Calbiochem) was 
at 6 /Ltg/ml plus 0.25 mg/ml xanthine (Sigma). The 
electroporation technique gave a transfection frequency of 

30 l-io x 10~ 3 for the Sp2/0 cells. 

The he3 light chain expression plasmid pING463 0 
was linearized by digestion with Pvul restriction 
endonuclease and transfected into Sp2/0 cells, giving 
mycophenolic acid - resistant clones which were screened 

35 for light chain synthesis. 

Four of the top-producing subclones, secreting 
4.9-7.5 Mg/ml were combined into two pools (2 clones/pool) 
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and each pool was transfected with plasmid pING4262l, 
containing the moderate-risk heavy chain. After selection 
with histidinol, the clones producing the most light plus 
heavy chain, Sp2/0-4630 and 4621 Clones C1705 and C1718, 
5 secreted antibody at approximately 15 and 22 Mg/ul 

respectively in the presence of 10~ 7 M dexamethasone in an 
overgrown culture, in a T25 flask* Clone C171S was 
deposited with the American Type Culture Collection, 1230 
Parklawn Drive, Rockville, Maryland, 20852 on December l f 
10 1992 as ATCC HB 11206. The best producer is a subclone of 

Clone C1718 which is produced by limiting dilution 
subcloning of Clone C1718. 

B. Purification Of he3 Antibody 
Secreted In Tissue Culture 

!5 Sp2/0-4630 + 4621 Clone C1705cells were grown in 

culture medium HB101 (Hana Biologies) + 1% Fetal Bovine 
Serum, supplemented with 10 mM HEPES , ix Glutamine-Pen- 
Strep (Irvine Scientific #9316) . The spent medium was 
centrifuged at about 5,000 x g for 20 minutes. The 

20 antibody level was measured by ELISA. Approximately 200 ml 

of cell culture supernatant was loaded onto a 2 ml Protein 
A-column (Sigma Chemicals) , equilibrated with PBS (buffer 
0.15 M NaCl, 5 mM sodium phosphate, 1 mM potassium 
phosphate, buffer pH 7.2). The he3 antibody was eluted 

25 with a step pH gradient (pH 5.5, 4.5 and 2.5). A fraction 
containing he3 antibody (9* yield) but not bovine antibody, 
was neutralized with 1 M Tris pH 8.5, and then concentrated 
10-fold by Centricon 30 (Amicon) diluted 10-fold with PBS, 
reconcentrated 10-fold by Centricon 30, diluted 10-fold 

30 with PBS, and finally reconcentrated 10-fold. The antibody 
was stored in 0.25 ml aliguots at -20° C. 

C. A^injty Meagqrgftsntg Of ftg? Iqg f<?r CP? 

The affinity of he3 IgG for CD5 was determined 
using Molt-4M cells, which express CDS on their surface, 
35 and I 123 -labeled chimeric H65 IgG in a competitive binding 
assay. Culture supernatants from Clone C1705 and C1718 and 
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purified IgG from C1705 were used as the sources of he3 
IgG. 

For this assay, 20 M9 of chimeric H65 IgG (cH65 
IgG) was iodinated by exposure to 100 
5 lactoperoxidase-glucose oxidase immobilized beads 

(Enzymobeads, BioRad) , 100 Mi of PBS, l.o mci I 125 (Amersham, 
IMS30) # 50 Ml of 55 mM b-D-glucose for 45 minutes at 23°C. 
The reaction was quenched by the addition of 20 ^1 of 105 
mM sodium metabisulf ite and 120 mM potassium iodine 

10 followed by centrifugation for 1 minute to pellet the 
beads. 125 I-cH65 IgG was purified by gel filtration using 7 
mis of sephadex G25, using PBS (137 mM NaCl, 1*47 mM KH 2 P0 4/ 
8.1 mM Na 2 HP0 4 , 2.68 mM KC1 at pH 7.2-7.4) plus 0.1* BSA. 
l2S I-cH65 IgG recovery and specific activity were determined 

15 by TCA precipitation. 

Competitive binding was performed as follows: 
100 Ml of Molt-4M cells were washed two times in ice-cold 
DHB binding buffer (Dubeilco's modified Eagle's medium 
(Gibco, 320-1965FJ) , 1.0% BSA and 10 mM Hepes at pH 7.2. 

20 -7.4). Cells were resuspended in the same buffer, plated 
into 96 v-bottomed wells (Costar) at 3 x 10* cells per well 
and pelleted at 4°C by centrifugation for 5 min at 1,000 
rpm using a Beckman JS 4.2 rotor; 50 Ml of 2 X- concentrated 
0.1 nM ^I-cHSS IgG in DHB was then added to each well and 

25 competed with 50 Ml of 2X - concentrated cH65 IgG or 
humanized antibody in DHB at final antibody concentrations 
from 100 nM to 0.0017 nM. Humanized antibody was obtained 
from culture supematants of Sp2/0 clone C1718 which 
expresses he3 IgG. The concentration of the antibody in 

30 the supematants was established by ELISA using a chimeric 
antibody as a standard. The concentration of the antibody 
in the purified preparation was determined by binding was 
allowed to proceed at 4°C for 5 hrs and was terminated by 
washing cells three times with 200 Ml of DHB binding buffer 

35 by centrifugation for 5 min at 1,000 rpm. All buffers and 
operations were at 4«C. Radioactivity was determined by 
solubilizing cells in 100 m! of 1.0 M NaOH and counting in 
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a Cobra II auto gamma counter (Packard) ♦ Data from binding 
experiments were analyzed by the weighted nonlinear least 
squares curve fitting program, MacLigand, a Macintosh 
version of the computer program "Ligand" from Kunson, 
5 Analyt* Biochem., 107:220 (1980). Objective statistical 

criteria (F, test, extra sum squares principle) were used 
to evaluate goodness of fit and for discriminating between 
models. Nonspecific binding was treated as a parameter 
subject to error and was fitted simultaneously with other 
1 0 parameters . 

Data showing relative 
binding of he 3 and CH65 to CDS on molt-4M cells in a 
competition binding assay demonstrate that 
the moderate-risk changes made in he3 IgG result in an 
15 antibody with a higher affinity than the chimeric mouse- 
human form of this antibody (cH65) for its target, CDS. 

Example 7 

Preparation of Gelonin Immunoconiuaates 

Gelonin analogs of the invention were variously 

20 conjugated to murine (ATCC HB9286) and chimeric H65 (cH65) 
antibody, cH65 antibody domains (including cFab, cFab* and 
cF(ab , ) 2 fragments), and humanized antibodies and antibody 
domains, all of which are specifically reactive with the 
human T cell determinant CDS. H65 antibody was prepared 

25 and purified by methods described in U.S. Patent 
Application Serial No. 07/306,433, supra and International 
Publication No. WO 89/06968, supra. Chimeric H65 antibody 
was prepared according to methods similar to those 
described in Robinson et al«, Human Antibodies and 

30 Hybridomas, 2:84-93 (1991), incorporated by reference 
herein. Chimeric H65 Fab, Fab', and F(ab')i proteins were 
prepared as described in Better, et al., Proc. Nat. Acad. 
Sci. (USA), 90: 457-461 (1993), incorporated by reference 
herein. Finally, he3 humanized antibodies were prepared 

35 according to the procedures described in U.S. Patent 
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Application Serial No* 07/808,464, incorporated by 
reference herein. 

A. Cgnjuqfltign T9 H$5 Antifrodieg 

To expose a reactive sulfhydryl, the unpaired 
5 cysteine residues of the gelonin analogs were first reduced 

by incubation with 0.1 to 2 oH DTT (30-60 minutes at room 
temperature) , and then were desalted by size-exclusion 
chromatography • 

Specifically, the Gel C24a analog (3*8 mg/ml) was 

10 treated with 2 mM DTT for 60 minutes in 0.1 M Naphosphate f 
0.25 M NaCl, pH 7.5 buffer. The Gel ca4 4 variant (7,6 mg/ml) 
was treated with 2 mM DTT for 30 minutes in 0.1 M 
Naphosphate, 0.25 H NaCl, pH 7.5 buffer. The Gel C2 4 7 analog 
(4 mg/ml) was treated with 2 mM DTT for 30 minutes in 0.1 

15 M Naphosphate, 0.5 M NaCl, pH 7.5 buffer with 0.5 mM EDTA. 

The Gel C239 variant (3.2 mg/ml) was treated with 2 mM DTT for 
30 minutes in 0.1 i Naphosphate, 0.5 M NaCl, pH 7.5 buffer 
with 0.5 mM EDTA. The Gel^c**, analog (4.2 mg/ml) was 
treated with 0.1 mM DTT for 30 minutes in 0.1 M 

20 Naphosphate, 0.1 M NaCl, pH 7.5 buffer with 0.5 mM EDTA. 

Lastly, the Gelc 10 variant (3.1 mg/ml) was treated with 1 mM 
DTT for 20 minutes in 0.1 M Naphosphate, 0.1 M NaCl, pH 7.5 
buffer with 1 mM EDTA. 

The presence of a free sulfhydryl was verified by 

25 reaction with DTNB and the average value obtained was 1.4 
± 0.65 SH/molecule. No free thiols were detected in the 
absence of reduction. 

H65 antibody and chimeric H65 antibody were 
chemically modified with the hindered linker 5-methyl-2- 

30 iminothiolane (M2IT) at lysine residues to introduce a 

reactive sulfhydryl group as described in Goff et al., 
Bioconjugate Chem., 1:381-386 (1990) and co-owned Carroll 
et al., U.S. Patent No. 5,093,475, incorporated by 
reference herein. 

35 Specifically, for conjugation with Gelc24s and 

Gelc^, murine H65 antibody at 4 mg/mL was derivitized with 
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18x M2IT and 2.5 mM DTNB in 25 mM TEOA, 150 mM NaCl, pH 8 
buffer for 1 hour at 23 *C. The reaction gave 1.9 linkers 
per antibody as determined by DTNB assay. 

For conjugation with Gel C247 and Gel C239 , H65 
5 antibody at 4.7 mg/mL was derivitized with 20x M2IT and 2.5 
mM DTNB in 25 mM TEOA 150 mM NaCl, pH 8 buffer for 50 
minutes at 23 *C. The reaction gave 1.6 linkers per 
antibody as determined by DTNB assay * 

Before reaction with Gel^cc**), H55 antibody at 5.8 
10 mg/mL was derivitized with 20x m2IT and 2.5 mM DTNB in 25 
mM TEOA, 150 mM NaCl, pH 8 buffer for 30 minutes at 23 *C. 
The reaction gave 1.5 linkers per antibody as determined by 
DTNB assay. 

For conjugation with Gel cl0 , H65 antibody at 2.2 
15 mg/mL was derivitized with lOx m2IT and 2.5 mM DTNB in 25 

mM TEOA, 150 mM NaCl, pH 8 buffer for 1 hour at 23 # C. The 
reaction gave 1.4 linkers per antibody as determined by 
DTNB assay. 

Chimeric H65 antibody was prepared for 
20 conjugation in a similar manner to murine H65 antibody. 

Two methods were initially compared for their 
effectiveness in preparing immunoconjugates with 
recombinant gel on in. First, the native disulfide bond in 
recombinant gelonin was reduced by the addition of 2mM DTT 
25 at room temperature for 30 minutes. The reduced gelonin 
was recovered by size-exclusion chromatography on a column 
of Sephadex GF-05LS and assayed for the presence of free 
sulfhydryls by the DTNB assay. 1.4 free SH groups were 
detected* This reduced gelonin was then reacted with H65- 
30 (M2IT) -S-S-TNB (1.8 TNB groups/H65) . Under these 

experimental conditions, little or no conjugate was 
prepared between reduced gelonin and thiol-activated H65 
antibody • 

In contrast, when both the recombinant gelonin 
35 and the H65 antibody were first derivitized with the 
crosslinker M2IT (creating gelonin- (M2IT) -SH and H65- 
(M21T)-S-S-TNB) and then mixed together, H65-(M2IT)-S-S- 
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(M2IT) -gelonin conjugate was prepared in good yield 
(toxin/ antibody ratio of 1.6). The starting materials for 
this conjugation (gelonin- (M2IT)-SH and H65-(M2IT) -S-S-TNB) 
contained linker /protein ratios of 1.2 and 1.4, 
5 respectively. Native gelonin was derivatized in a similar 

manner prior to conjugation to murine or chimeric H65 
antibody. 

The reduced gelonin analogs were mixed with H65- 
(M2 IT) -S-S-TNB to allow conjugation* The following 

10 conjugation reactions were set up for each analog: 23 mg 

(in 7.2 ml) of H65-M2IT-TNB were mixed with a 5-fold molar 
excess of Gel^** (23 mg in 6 ml) for 2 hours at room 
temperature, then for 18 hours overnight at 4*C; 23 mg (in 
7.3 ml) of H65-m2IT-TNB were mixed with a 5-fold molar 

15 excess of Gel C244 (23 mg in 3 ml) for 3 hours at room 

temperature, then for ^18 hours overnight at 4*C; 9 mg (in 
2.8 mL) of H65-m2IT-TNB were mixed with a 5-fold molar 
excess of Gel C2 *7 (9 mg in 2.25 mL) for 2 hours at room 
temperature, then for 5 nights at 4*C; 9 mg (in 2.8 mL) of 

20 H65-m2IT-TNB were mixed with a 5-fold molar excess of Gel C23 , 
(9mg in 2.6 mL) for 2 hours at room temperature, then at 
4*c for 3 days; 12 mg (in 1.9 mL) of H65-m2IT-TNB were 
mixed with a 5.6-fold molar excess of Gel^c^, (13.44 mg in 
3.2 mL) for 4*5 hours at room temperature, then 4"C 

25 overnight; and 11 mg of H65-m2IT-TNB were mixed with a 5- 

fold molar excess of Gelc 10 (11 mg in 3.5 mL) for 4 hours at 
room temperature, then at 4'C overnight. 

Following conjugation, unreacted M2IT linkers on 
the antibody were quenched with 1:1 mole cysteamine to 

3 0 linker for 15 minutes at room temper ature. The quenched 
reaction solution was then loaded onto a gel filtration 
column [Sephadex G-150 (Pharmacia) in the case of Gel C2 4«, 
Gel^, Gelc2 44 and Gel^, and an AcA-44 column (IBF 
Biotecnics, France) in the case of Gel^otc**) and Gel^] . The 

35 reactions were run over the gel filtration columns and 
eluted with 10 mM Tris, 0.15M NaCl pH 7. The first peak 
off each column was loaded onto Blue Toyopearl* resin 
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(ToSoHaas, Philadelphia, Pennsylvania) in 10 mM Tris, 3 0 mM 
NaCl, pH 7 and the product was eluted with 10 mM Tris, 0.5 M 
5 NaCl, pH 7.5. 

Samples of the final conjugation products were run on 
5% non-reduced SDS PAGE, Coomassie stained and scanned with a 
Shimadzu laser densitometer to quantitate the number of toxins 
per antibody (T/A ratio) . The yield of final product for each 

10 analog conjugate was as follows: Gel C 248# 17 mg with a T/A ration 
of 1.6; Gel C 247/ 1 - 1 nvg with a T/A ratio of 1; Gel c24 4, 4 . 5 mgs 
with a T/A ratio of 1.46; Gel c2 39/ 2 . 9 mg with a T/A ratio of 
2.4; Gel A50 (c44)/ 7.3 mg with a T/A ratio of 1.22; and Gel cl0/ 6.2 
mg with a T/A ratio of 1.37. Conjugation efficiency (i.e., 

15 conversion of free antibody to immunoconjugate) was 
significantly greater (-80%) for some analogs (Gel C i 0 # Gel A50 (c44)/ 
Gel C2 39/ Gel c2 47, and Gel C 24s) than for others (-10%, Gel C2 4 4 ) . 

B. Gelonin Immunoconjugates With 
20 Chimeric And Humanized Antibodies 

Analogs Gel C 24?/ and Gel A 50<c44)# were also conjugated to 

various chimeric [cH65Fab, cH65Fab» and cH65F (ab , ) 2 i/ and 

"human engineered" [hel Fab, he2-Fab, he3-Fab, hel Fab 1 and hel 

F (ab 1 ) 2] / antibody fragments . Chimeric H65 antibody fragments 

25 may be prepared according to the methods described in 
International Publication No. WO 89/00999, supra. The DNA 
sequences encoding the variable regions of H65 antibody 
fragments that were human engineered (referring to the 
replacement of selected murine-encoded amino acids to make the 

30 H65 antibody sequences less immunogenic to humans) according to 
the methods described above in Example 5, are set out in SEQ ID 
NO: 69 (variable region of the kappa chain of hel and he2) , SEQ 
ID NO: 70 (variable region of the gamma chain of hel)/ SEQ ID 
NO: 71 (variable region of the gamma chain of he2 and he3) and 

35 SEQ ID NO: 72 (variable region of the kappa chain of he3) . 

The chimeric H65 antibody fragments were conjugated 
to the Gel c247 analog in the same manner as 
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described below for conjugation of human engineered Fab and 
Fab* fragments to Gel C2 < 7 and Gel A30(C * M . 



The hel Fab was dialyzed into 25 mM TEOA buffer, 
5 250 mM NaCI, pH 8 and then concentrated to 6.8 mg/mL prior 

to derivitization with the M2IT crosslinker. For the 
linker reaction, M2IT was used at 20-fold molar excess, in 
the presence of 2.5 mM DTNB. The reaction was allowed to 
proceed for 30 minutes at room temperature, then desalted 

10 on GF05 (gel filtration resin) and equilibrated in 0*1 M Na 

phosphate, 0.2M NaCI, pH 7.5. A linker number of 1.8 
linkers per Fab was calculated based on the DTNB assay. 
The hel Fab-M2 IT-TNB was concentrated to 3.7 mg/mL prior to 
conjugation with Gel^*?. 

15 Gel C247 at 12.8 mg/mL in 10 mM Na phosphate, 0.3M 

NaCI, was treated with 1 mM DTT, 0.5 mM EDTA for 20 minutes 
at room temperature to expose a reactive sulfhydryl for 
conjugation and then was desalted on GF05 and equilibrated 
in 0.1 M Na phosphate, 0.2 M NaCI, pH 7.5. Free thiol 

20 content was determined to be 0.74 moles of free SH per mole 

of Gel c2 * 7 using the DTNB assay. The gelonin was 
concentrated to 8.3 mg/mL prior to conjugation with 
activated antibody. 

The conjugation reaction between the free thiol 

25 on Gelc2*7 and the derivitized hel Fab-M2IT-TNB, conditions 
were as follows. A 5-fold excess of the gelonin analog was 
added to activated hel Fab— M2 IT-TNB (both proteins were in 
0*1M Na phosphate, 0.2M NaCI, pH7.5) and the reaction 
mixture was incubated for 3.5 hours at room temperature and 

30 then overnight at 4*C. Following conjugation, untreated 
M2IT linkers were quenched with 1:1 mole cysteamine to 
linker for 15 minutes at room temperature. The quenched 
reaction solution was loaded onto a gel filtration column 
(G-75) equilibrated with 10 mM Tris, 150 mM NaCI, pH 7. 

35 The first peak off this column was diluted to 30 mM NaCI 
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with 10 mM Tris, pH7 and loaded on Blue Toyopearl®. The 
product was eluted with 10 mM Tris, 0.5 M NaCl, pH 7.5. 



(ii) tel T$b'-Gel rZh7 

Similarly, the H65 hel Fab' fragment was dialyzed 
5 into 25 mM TEOA buffer, 400 mM NaCl, pH 8 at 2.9 mg/mL 

prior to deriviti2ation with the M2IT crosslinker. For the 
linker reaction, M2IT was used at 20-fold molar excess, in 
the presence of 2.5 mM DTNB. The reaction was allowed to 
proceed for 1 hour at room temperature then it was desalted 

10 on GF05 (gel filtration resin) and equilibrated in 0.1 M Na 

phosphate, 0.2 M NaCl, pH 7.5. A linker number of 1.6 
linkers per Fab 1 was calculated based on the DTNB assay. 
The hel Fab 1 -M2 IT-TNB was concentrated to 3.7 mg/mL prior 
to conjugation with Gelcm 

15 The Geica*? at 77 mg/mL was diluted with 10 mM Na 

phosphate, 0.1 M NaCl to a concentration of 5 mg/mL, 
treated with 1 mM DTT, 0.5 mM EDTA for 30 minutes at room 
temperature to expose a free thiol for conjugation and then 
was desalted on 6F05 and equilibrated in 0.1 M Na 

20 phosphate, 0.2 M NaCl, pH 7.5. Free thiol content was 

determined to be 1.48 moles of free SH per mole of Gel C247 
using the DTNB assay. The Gel^^ was concentrated to 10 
mg/mL prior to conjugation with activated hel Fab f -M2IT- 
TNB. 

25 For the reaction between the free thiol on Gel C247 

and the derivitized hel Fab 9 -M2IT-TNB , conditions were as 
follows. A 5.7-fold molar excess of gelonin was added to 
activated hel Fab 1 -M2 IT-TNB and the final salt 
concentration was adjusted to 0.25 M. The reaction mix was 

3 0 incubated for 1.5 hours at room temperature and then over 

the weekend at 4*C. Following conjugation, unreacted M2IT 
linkers were quenched with 1:1 mole cysteamine to linker 
for 15 minutes at room temperature. The quenched reaction 
solution was loaded onto a gel filtration column (AcA54) 

35 equilibrated with 10 mM Tris, 250 mM NaCl, pH 7.5, The 
first peak off this column was diluted to 20 mM NaCl with 
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10 mM Tris, pH 7 and loaded on Blue Toyopearl* which was 
equilibrated in 10 mM Tris, 20 mM NaCl, pH 7. The column 
was then washed with 10 mM Tris f 30 mM Nacl f pH 7.5. The 
product was eluted with 10 mM Tris, l M NaCl, pH 7.5. 

5 (iii) ha2-Fab Gel,. v -> M 

The he2-Fab was dialyzed overnight into 25 mM 
TEOA, 0.25 M NaCl, pH 8 buffer and then concentrated to 
13.3 mg/mL prior to derivitization with the M2IT 
cross linker . For the linker reaction , M2IT was used in a 

10 20-fold molar excess in the presence of 2.5 mM DTNB. The 

reaction was allowed to proceed for 20 minutes at room 
temperature and was then desalted on a GF05-LS (gel 
filtration) column, equilibrated in 0.1 M Na phosphate, 0.2 
M NaCl with 0.02% Na azide. A linker number of 1.7 linkers 

15 per Fab-M2 IT-TNB was calculated based on the DTNB assay. 

After derivitization and gel filtration, the he2-Fab 
concentration was 5.2 mg/mL. 

Ge 1^0(04*) at 8.33 mg/mL in 10 mM Na phosphate, pH 
7.2 was treated with 5 mM DTT and 0.5 mM EDTA for 30 

20 minutes at room temperature to expose a reactive thiol for 
conjugation and then was desalted on GF05-LS resin 
equilibrated in 0.1 M Na phosphate, 0.1 M NaCl with 0.5 mM 
EDTA plus 0.02% Na azide, pH 7.5. Free thiol content was 
determined to be 0.83 moles of free SH per mole of Gel^o^, 

25 using the DTNB assay. The gelonin was concentrated to 11.4 

mg/mL prior to conjugation with activated he2-Fab. 

The conjugation reaction conditions between the 
free thiol on Gel A50( c44> and the derivitized he2-Fab-M2IT-TNB 
were as follows* A 3-fold excess of the gelonin analog was 

30 added to activated he2 -Fab-M2 IT-TNB (both proteins were in 

0.1 M Na phosphate, 0.1 M NaCl, pH 7.5 but the gelonin 
solution contained 0.5 mM EDTA as well). The reaction 
mixture was concentrated to half its original volume, then 
the mixture was incubated for 4 hours at room temperature 

3 5 followed by 72 hours at 4 # C. Following the incubation 



WO 94/26910 



PCT/US94/05348 



-70- 

period the efficiency of conjugation was estimated at 7 0- 
75% by examination of SDS PAGE. 

Following conjugation the excess M2IT linkers 
were quenched by incubation with 1:1 mole cysteamine to 
5 linker for 15 minutes at room temperature. The quenched 

reaction as loaded onto a gel filtration column (G-75) 
equilibrated in 10 mM Tris, 0.15 M NaCl, pH 7. The first 
peak off this column was diluted to 30 mM NaCl with 10 mM 
Tris, pH 7 and loaded onto a Blue Toyopearl* (TosoHaas) 
10 column. The product was eluted with 10 mM Tris, 1 M NaCl, 
pH 7.5. 



(iv) hj?-fafc Ggl^ 0(C44) 

Similarly, the he3-Fab was dialyzed overnight 
into 25 mM TEOA, 0.25 M NaCl, pH 8 buffer and then 

15 concentrated to 5 mg/mL prior to derivitization with the 

M2IT cross linker. For the linker reaction, M2IT was used 
in a 10-fold molar excess in the presence of 2.5 mM DTNB. 
The reaction was allowed to proceed for 45 minutes at room 
temperature and was then desalted on a GF05-LS (gel 

20 filtration) column, equilibrated in 0.1 M Na phosphate, 0.2 

M NaCl with 0.02% Na azide* A linker number of 1 M2IT per 
Fab-M2IT-TNB was calculated based on the DTNB assay. After 
derivitization and gel filtration, the he3-Fab 
concentration was 5*3 mg/mL, 

25 G*1am<c44) at 7.8 mg/mL in 0.1 M Na phosphate, 0.1 

M NaCl, pH 7.5 was treated with 1.5 mM DTT and 1 mM EDTA 
for 30 minutes at room temperature to expose a reactive 
thiol for conjugation and then was desalted on GF05-LS 
resin equilibrated in 0.1 M Na phosphate, o.l M NaCl plus 

3 0 0.02% Na azide, pH 7.5. Free thiol content was determined 

to be 0.66 moles of free SH per mole of Gel^,,^, using the 
DTNB assay. The gelonin was concentrated to 5.2 mg/mL 
prior to conjugation with activated he3-Fab. 

The conjugation reaction conditions between the 

35 free thiol on Gel^^ and the derivitized he3-Fab-M2IT-TNB 
were as follows. A 5-fold excess of the gelonin analog was 
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added to activated he3-Fab-M2IT-TNB (both proteins were in 
0.1 M Na phosphate 0.1 M NaCl, pH 7.5). The reaction 
mixture was incubated for 2 hours at room temperature 
followed by 72 hours at 4*C. Following the incubated 
5 period the efficiency of conjugation was estimated at 70- 

75% by examination of SDS PAGE. 

Following conjugation, the excess M2IT linkers 
were quenched by incubation with 1:1 mole cysteamine to 
linker for 15 minutes at room temperature. The quenched 

10 reaction was loaded onto a GammaBind G (immobilized protein 

G affinity resin f obtained from Genex, Gaithersburg, 
Maryland) equilibrated in 10 mM Na phosphate, 0.15 M Naci, 
pH 7. It was eluted with 0.5 M NaOAc, pH 3 and neutralized 
with Tris. It was dialyzed into 10 mM Tris, 0.15 M NaCl, 

15 pH 7 overnight, then diluted to 30 mM NaCl with 10 mM Tris, 

pH 7 and loaded onto a blue Toyopearl* (TosoHaas) column. 
The product was eluted with 10 mM Tris, 1 M NaCl, pH 7.5 

Example a 

Whole Cell Kill Assays 

20 Immunoconjugates prepared with gelonin and 

gelonin analogs were tested for cytotoxicity against an 
acute lymphoblastoid leukemia T cell line (HSB2 cells) and 
against human peripheral blood mononuclear cells (PBMCs) . 
Immunoconjugates of ricin A-chain with H65 antibody (H65- 

25 RTA) and antibody fragments were also tested. The ricin A- 

chain (RTA) as well as the H65-RTA immunoconjugates were 
prepared and purified according to methods described in 
U.S. Patent Application Serial No. 07/306,433, supra and in 
International Publication No. WO 89/06968, supra. 

30 Briefly, HSB2 cells were incubated with 

immunotoxin and the inhibition of protein synthesis in the 
presence of immunotoxin was measured relative to untreated 
control cells. The standard immunoconjugates H65-RTA (H65 
derivitized with SPDP linked to RTA) , H65-Gelonin and H65- 

35 rGelonin, H65 fragment immunocon jugate, and gelonin 

immunocon jugate samples were diluted with RPMI without 
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leucine at half -log concentrations ranging from 2 000 to 
0.632 ng/ml. All dilutions were added in triplicate to 
wells of microtiter plates containing l x 10 3 HSB2 cells per 
well. HSB2 plates were incubated for 20 hours at 37 # c and 
5 then pulsed with 3 H-Leu for 4 hours before harvesting. 

Samples were counted on the Inotec Trace 96 cascade 
ionization counter- By comparison with an untreated 
sample, the picomolar concentration (pM) of immunotoxin 
which resulted in a 50% inhibition of protein synthesis 

10 (IC 30 ) was calculated. In order to normalize for conjugates 

containing differing amounts of toxin or toxin analog, the 
cytotoxicity data were converted to picomolar toxin (pM T) 
by multiplying the conjugate IC 50 (in pM) by the 
toxin/ antibody ratio which is unique to each conjugate 

15 preparation . 

The PMBC assays were performed as described by 
Fishwild et al., Clin, and Exp. Immunol., 85:506-513 (1991) 
and involved the incubation of immunoconjugates with PBMCs 
for a total of 90 hours* During the final 16 hours of 

20 incubation, 3 H-thymidine was added; upon completion, 

immunocon jugate- induced inhibition of ONA synthesis was 
quantified. The activities of the H65 and chimeric H65 
antibody conjugates against HSB2 cells and PBMC cells are 
listed in Table. 2 below. 
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Table 2 

IC 50 (pM T) 

Conjugate H5B2 Call* 

H65-RTA 143 

5 H65- (M2IT) -S-S- (M2IT) -Gelonin 1770 

H65- (M2IT) -S-S- (M2IT) -rGelonin 276 

H65- (M2IT) -S-S-Gel cl0 140 
H65- (M2IT) -S-S-Gel^o^, 99 51 

H65- (M2IT) -S-S-Gel^, 2328 

10 H65-(M2IT)-S-S-Gel C2M >5000 

H65- (M2IT) -S-S-Gel^ 41 

H65-(M2IT)-S-S-Gelc2 4a 440 

cH65-RTA 30 60 

CH65- (M2IT) -S-S- (M2IT) -Gelonin 1770 

15 cH65- (M2 IT) -S-S- (M2IT) -rGelonin 153 

CH65-(M2IT)-S-S-Gel ca3 , >7000 

CH65- (M2IT) -S-S-Gelc2 47 34 

CH65- (M2IT) -S-S-Gel^ 238 
H65-(M2IT)-S-S-Gel AM(C30) 338 ND* 

20 H65- (M2IT) -S-S-Gel^,^^ 71 ND* 

~ Not determined. 



PSHCs 

459 

81 

75 

28 

180 

>2700 

35 

203 

400 

140 

120 

290 

60 

860 



Against HSB2 cells, many of the gelonin analog 
immunoconjugates were significantly more potent than 
conjugates prepared with native gelonin or recombinant, 

25 unmodified gelonin, both in terms of a low IC 30 value, but 
also in terms of a greater extent of cell kill. Against 
human PBMCs, the gelonin analog conjugates were at least as 
active as native and recombinant gelonin conjugates. 
Importantly, however, some of the conjugates (for example, 

30 Gelcio* ^^0(0*4) and Gel^y) exhibited an enhanced potency 
against PBMCs compared to native and recombinant gelonin 
conjugates, and also exhibited an enhanced level of cell 
kill. 

The activities of the H65 antibody fragment 
35 conjugates against HSB2 cells and PBMC cells are listed in 
Tables 3 and 4 below, wherein extent of kill in Table 3 
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refers to the percentage of protein synthesis inhibited in 
HSB2 cells at the highest immunotoxin concentration tested 
(1 Mg/ml) . 

Table 3 

5 IC 50 (pM T) 



conjuqate H?g2 


Cells 


PBMCS 


cH65Fab 9 -RTA 3 0 


530 


1800 


cH65Fab 1 -rGelonin 


135 


160 


cHSSFab'-Gelc^ 


48 


64 


cH65F(ab' ) 2 -RTA 30 


33 


57 


cH65F(ab* ) 2 -rGelonin 


55 


34 


cH65F(ab')2-<3el C247 


23 


20 


cHSSFCab'Jj-Gel^a 


181 


95 



15 IC 30 (pM T) 



Conjugate HSB2 Cells Extent of Kill 

hel Fab'-Gel^y 57.7 93% 

hel Fab-Gelcj^ 180.0 94% 

he2-Fab-Gel AS0 (C4*> 363.0 91% 

20 heS-Fab-Gel^occ^) 191.0 93% 

cHeSFab'-Gelca^ 47.5 93% 

CH65F ( ab ■ ) 2 -rGelonin 45.4 85% 

cH65F(ab») 2 -Gelc247 77.5 83% 

cH65F (ab • ) 2 -Galca47 23.2 85% 



25 The data in Table 3 show that monovalent (Fab or 

Fab') fragments conjugated to various forms of gelonin are 
more potent than RTA conjugates. Table 4 shows that the 
human-engineered gelonin-Fab conjugates exhibit a very high 
degree of extent of kill. 
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Exaarole 9 

Properties Of Gelonin T^unoconiuaatie^ 

Recombinant gelonin and the gelonin analogs 
5 exhibited enhanced solubility in comparison to both native 

gelonin and RTA3 0 « In addition, recombinant gelonin and 
gelonin analog immunoconjugates exhibited enhanced 
solubility relative to immunoconjugates prepared with 
native gelonin and RTA30. This enhanced solubility was 
10 particularly noteworthy for recombinant gelonin and analog 

conjugates prepared with chimeric Fab fragments* 

B. Disulfide Bond Stability Assay 

The stability of the disulfide bond linking a RIP 
to a targeting molecule (such as an antibody) is known to 

15 influence the lifespan of immunoconjugates in vivo [See 

Thorpe et al., Csuxcer Res., 47:5924-5931 (1987), 
incorporated by reference herein]. For example, conjugates 
in which the disulfide bond is easily broken by reduction 
in vitro are less stable and less efficacious in animal 

20 models [See Thorpe et al., Cancer Res., 48:6396-6403 

(1988), incorporated by reference herein]. 

Immunoconjugates prepared with native gelonin, 
recombinant gelonin and gelonin analogs were therefore 
examined in an in vitro disulfide bond stability assay 

25 similar to that described in Wawrzynczak et al., Cancer 

Res., 50:7519-7526 (1990), incorporated by reference 
herein. Conjugates were incubated with increasing 
concentrations of glutathione for 1 hour at 37 *C and, after 
terminating the reaction with iodoacetamide, the amount of 

30 RIP released was guantitated by size-exclusion HPLC on a 
TosoHaas TSK-G2000SW column. 

By comparison with the amount of RIP released by 
high concentrations of 2-mercaptoethanol (to determine 100% 
release) , the concentration of glutathione required to 

35 release 50% of the RIP (the RC 50 ) was calculated. The 
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results of assays for H65 antibody conjugates are set out 
in Table 5 below. 



Table 5 

conjugate RC«„ mm 

H65-RTA 30 3<2 

H65-(K2IT) -S-S- (M2IT) -gelonin n.i 

H65- (M2IT) -S-S- (M2IT) -rGelonin 3 . 0 

H65- (M2IT) -S-S-Gel cao 2 . 5 

H65-(M2IT)-S-S-Gel A30 (c**) 0.6 

H65- (M2IT) -S-S-Gel C23 , 774 . 0 

H65- (M2IT) -S-S-Gel^ 1.2 

H65-(M2IT) -S-S-Gel C247 0 . 1 

HS5-(M2IT)-S-S-Gel C24B 0.4 

CH65-RTA 30 2 . 50 

CH65- (M2IT) -S-S- (M2IT) -rGelonin 2.39 

cH65-(M2IT)-S-S-Gel C247 0.11 

CH65-(M2IT)-S-S-Gel c24 , 0.32 

H65- (M2IT) -S-S-Gel A „ (C30) 9 . 2 

H65- (M2IT) "S-S-Gelc^^o 0 . 3 



The foregoing results indicate that the stability of the 
bonds between the different gelonin proteins and H65 
antibody varied greatly, with the exception of Gelc 10 and 
Gel cza»» most of the gelonin analogs resulted in conjugates 
with linkages that were somewhat less stable in the in 
vitro assay than the dual-linker chemical conjugate. The 
stability of the Gel^, analog, however, was particularly 
enhanced . 

The results of the assay for H65 antibody 
fragment conjugates are set out in Table 6 below. 
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Tabla 6 

Conjugate RC M (wM) 

hel Fab » -Gel C24 » 0.07 
cFab r -Gelonin 1 . 27 

5 cFab'-Gel C2 ^ 7 0,08 

cFfabMa-RTA 30 1.74 
cF ( ab 1 ) 2 -rGelonin 2.30 
cF ( ab » ) z-Gelc2* 7 0.09 
cF(ab« >2-<5elc2*e 0.32 
10 he2-Fab-Gel A5fl(CM) 0.46 

heS-Fab-Gel^^, 0 . 58 



From the RC 50 results presented in Tables 5 and 6, 
it appears that the particular RIP analog component of each 
immunotoxin dictates the stability of the immunotoxin 
15 disulfide bond In vitro* 

Pharmacokinetics Of Conjugates To H65 Antibody 

The pharmacokinetics of gelonin analogs Gel^*?, 
G elAso<c*4)» and Gelc 10 linked to whole H65 antibody was 

20 investigated in rats. An IV bolus of 0.1 mg/kg of X25 I- 

labelled immunocon jugate H65-(M2IT) -S-S-Gelc a47 , H65-(M2IT)- 
S-S-Geljuo^, or H65-(M2IT)-S-S-Gelcio was administered to 
male Sprague-Dawley rats weighing 134-148 grams. Serum 
samples were collected from the rats at 3, 15, 30 and 45 

25 minutes, and at 1.5, 2, 4, 6, 8, 18, 24, 48, 72, and 96 
hours. Radioactivity (cpa/ml) of each sample was measured, 
and SOS-PAGE was performed to determine the fraction of 
radioactivity associated with whole immunoconjugate. 
Immunoconjugate-associated serum radioactivity was analyzed 

30 using the computer program PCNONLIN (SCI Software, 
Lexington, Kentucky) . Table 7 below lists the 

pharmacokinetic parameters of the immunocon jugates. In 
that table, the standard error for each value is indicated 
and a one way analysis of variance is presented, IC is the 

35 immunoconjugate (specified by the abbreviation for the 
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gelonin variant that is part of the immunoconjugate) , n is 
the number of animals in the study, Vc is the central 
volume of distribution, CI is the clearance, MRT is the 
total body mean residence time, Alpha is the a half -life 
5 and Beta is the 0 half -life of the immunoconjugate. 
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The Gel C247 immunoconjugate was found to have a and 
0 half lives of 2.3 and 20 hours, with a total mean 
residence time of 17 hours. The 72 end 96 hour time points 
were excluded from analysis because of the poor resolution 
5 of immunocon jugate associated radioactivity on the SDS-PAGE 

gel for these serum samples. 

Because in vitro studies suggested that the Gel cla 
immunocon jugate had greater disulfide bond stability it 
was anticipated that its half lives in vivo would be longer 

10 relative to the cys 247 form of the immunoconjugate. The 0 

half life of the immunoconjugate was about 33 hours 
compared to 20 hours for the Gelc^ conjugate. The total 
mean residence time was also much greater for the Gel^ 
immunoconjugate (42 hours versus 42 hours for the Gel 247 

15 conjugate) . In addition, the clearance of the Gel ci0 

immunoconjugate was 2.5 ml/hr/kg, about four times less 
than that of the Gel^^ immunoconjugate (11 ml/hr/kg) . As 
also predicted from the in vitro disulfide stability data, 
the clearance of the Gs1aso<c**) immunoconjugate was 

20 intermediate between those of the Gelc 1Q and Gel C247 

immunocon jugates . 

Based on these studies, the Gel cl0 analog 
conjugated to H65 antibody has greater in vivo stability 
than the Gel A50(C 4 4} and Gel C247 analogs conjugated to H65 

25 antibody (as determined by the longer mean residence time 

and clearance rates) , although the properties of the 
Gel^occ**, immunoconjugate more closely resembled those of the 
Geldo immunoconjugate than the Gel C247 immunoconjugate. 



Simple U 

30 Pharmacokinetics Of Conjugates To H65 Antibody Fragments 

The pharmacokinetics of Gel^*? and Gel^^M 
analogs linked to human engineered H65 Fab fragments were 
also investigated in rats. An IV bolus of 0.1 mg/kg of 125 1- 
labelled hel H65 Fab-Gel C247 , he2 H65 Fab-Gel^c^, or he3 H65 

35 Fab-Gel^occM) was administered to male Sprague-Dawley rats 
weighing 150-180 grams* Serum samples were collected at 3, 
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5, 15, 20, 30, and 40 minutes, and 1, 1.5, 3, 6, 8, 18, 24, 
32, 48, and 72 hours, and were analyzed by ELISA using 
rabbit anti-Gelonin antibody as the capture antibody and 
biotin-labelled goat anti-human kappa light chain antibody 
5 as the secondary antibody. Results of the analysis are 

presented in Table 8 below. In the table, the standard 
error for each value is shown, and IC is the 
immunoconjugate, n is the number of animals in the study, 
Vc is the central volume of distribution, Vss is the steady 
10 state volume of distribution, CI is the clearance, MRT is 

the total body mean residence time, Alpha is the a half* 
life and Beta is the 0 half-life of the indicated 
conjugate. 
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Comparing the three immunocon jugates, the 
pharmacokinetics of hel H65 Fab-Gel C247 , he2 H65 Fab-Gel M0tC44) 
and he3 -Fab-Gel^ tC4 «) were very similar, having similar alpha 
and beta half -lives, mean residence times, and clearance, 
5 particularly when comparing parameters obtained from the 

ELISA assayed curves . This is in contrast to their whole 
antibody immunocon jugate counterparts, where the clearance 
of Gel C24 7 immunocon jugate (11 ml/kg/hr) was three-fold 
greater than that of Geljuocc**) immunocon jugate (4 ml/kg/hr) . 
10 This suggests that cleavage of the disulfide bond linking 
the Fab fragment and gelonin is not as important for the 
serum clearance of Fab immunocon jugates as for whole 
antibody immunocon jugates. 



Example 12 

15 Immunoaenicitv Of Immunoconiuaates 

Outbred Swiss /Webster mice were injected 
repeatedly (0.2 mg/kg each injection) with murine H65 
antibody conjugates prepared with RTA, RTA30 and 
recombinant gelonin. The cycle was such that each animal 

20 was injected on days 1 and 2, and then the injections were 
repeated 28 and 29 days later. The animals received 5 such 
cycles of injections. One week and three weeks following 
each series of injections, blood was collected and the 
amount of anti-RIP antibodies present was determined by 

25 ELISA; peak titers for each cycle are shown in Table 9. 

RTA and RTA30 generated strong responses which began 
immediately following the first cycle of injections and 
remained high throughout the experiment. In contrast, no 
immune response was detected for the gelonin conjugate, 

30 even after 5 cycles of injections. When the conjugates 

were mixed with Complete Freund Adjuvant and injected i.p. 
into mice, anti-RTA and RTA-30 antibodies were readily 
detected after several weeks. These data indicate that 
anti-gelonin antibodies, if generated, would have been 

35 detected by the ELISA assay, and suggest that recombinant 
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gelonin may be much less immunogenic in animals than is 
RTA. 

Table 9 



Cvcle H65-RTA H65-RTA30 H65-rGel 

5 Prebleed 100 100 100 

Cycle 1 168 117 100 

Cycle 2 4208 1008 100 

Cycle 3 7468 3586 100 

Cycle 4 5707 3936 100 

10 Cycle 5 4042 2505 100 



Example 13 

In vivo Efficacy Of Immunoconiuaates 

A human peripheral blood lymphocyte (PBL) - 
reconstituted, severe combined immunodef icient mouse model 
15 was utilized to evaluate the in vivo efficacy of various 
immunoconjugates comprising the gelonin analogs Gel^^ and 
Geljuaccw Immunoconjugates were tested for the capacity to 
deplete human blood cells expressing the C05 antigen* 



A. Human PBL Donors And Cell Isolation 

20 Human peripheral blood cells were obtained from 

lymphapheresis samples (HemaCare Corporation, Sherman Oaks, 
CA) or venous blood samples (Stanford University Blood 
Bank, Palo Alto, CA) collected from healthy donors. Blood 
cells were enriched for PBLs using Ficoll-Hypaque density 

25 gradient centrifugation (Ficoll-Paque*; Pharmacia, 
Piscataway, New Jersey) and subsequently washed 4 times 
with PBS. Residual erythrocytes were lysed with RBC lysing 
buffer (16 MM ammonium chloride, 1 mM potassium 
bicarbonate, 12.5 EDTA) during the second wash. Cell 

30 viability in the final suspension was >95% as assessed by 

trypan blue dye exclusion. 
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B. Animals And Human PBL T ransfer 

CB.17 scld/scid (SCID) mice were purchased from 
Taconic (Germantovn, New York) or were bred under sterile 
conditions in a specific pathogen-free animal facility 
5 (original breeding pairs were obtained from Hana Biologies, 

Alameda, California) . Animals were housed in filter-top 
cages and were not administered prophylactic antibiotic 
treatment* Cages, bedding, food and water were autoclaved 
before use. All manipulations with animals were performed 

10 in a laminar flow hood. 

Untreated SCID mice were bled for determination 
of mouse Ig levels. Human PBL- injected mice were bled at 
various intervals for quantitation of human Ig and SIL-2R. 
Blood collection was from the retro-orbital sinus into 

15 heparinized tubes. Blood samples were centrifuged at 3 00 

x g for 10 min, and plasma was collected and stored at 
-70 "C. Mouse and human Ig were quantified using standard 
sandwich ELI S As. Briefly, flat-bottom microtiter plates 
(MaxiSorp Immuno-Plates, Nunc, Roskilde, Denmark) were 

20 coated overnight at 4*C with goat anti-mouse IgG+IgA+IgM 

(Zymed Laboratories, Inc., South San Francisco, California) 
or goat anti-human Igs (Tago, Inc., Burlingame, California) 
in bicarbonate buffer, pH 9.6. Plates were blocked for 2 
hours at room temperature with 1% BSA in Tris-buf f ered 

25 saline, pH 7.5 (TBS), and then incubated at 37 'C for 1 hour 

with standards or samples serially-diluted in TBS/1% 
BSA/0. 05% Tween 20. Standards used were a monoclonal mouse 
Ig62a (IND1 anti-melanoma; XOMA Corporation, Berkeley, 
California) and polyclonal human ig (Sigma Chemical Co., 

3 0 St. Louis, Missouri) . Subsequently, plates were washed 
with TBS/Tween 20 and incubated at 37 9 C for 1 hour with 
alkaline phosphatase-conjugated goat anti-mouse IgG+IgA+IgM 
or goat anti-human Igs (Caltag Laboratories, South San 
Francisco, California) . Detection was by measurement of 

35 absorbanc€u at 405 nm following incubation with 1 mg/ml p- 
nitro-phenylphosphate (Sigma) in 10% diethanolamine buffer, 
pH 9.8. Plasma from a normal BALB/c mouse was used as a 
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positive control in the mouse Ig ELISA. Plasma samples 
from naive SCID mice or normal BALB/c mice did not have 
detectable levels of human Ig, Human sIL-2R was quantified 
using an ELISA kit (Immunotech S.A., Marseille, France) as 
5 per the manufacturer's instructions. 

Five-to-seven week old mice with low plasma 
levels of mouse Ig (<10ug/ml) were preconditioned with an 
i.p. injection of cyclophosphamide (Sigma) at 200 mg/kg. 
Two days later, they were injected i.p. with 25-40 x 10* 
10 freshly-isolated human PBL suspended in 0.8 ml PBS. 

C. Immunoconiuaate Treatment 

SCID mice were bled at approximately 2 weeks 
after human PBL transplantation. Mice with undetectable 
(<10 pM) or low plasma levels of human sIL-2R were 

15 eliminated from the study. The cut-off for exclusion of 
mice with detectable, but low, levels of human SIL-2R was 
empirically determined for each study and was generally 20 
pM. The remaining mice were divided into groups and were 
administered vehicle or immunocon jugate as an i.v. bolus 

20 (0.2 mg/kg) daily for 5 consecutive days. Animals were 

sacrificed 1 day after cessation of treatment for 
quantitation of human T cells in tissues and human sIL-2R 
in plasma. 

CPllg<?ti<?n Of Tis^es An<j ^Xysjs Qt FPfr Depletion 
25 Blood was collected from the retro-orbital sinus 

into heparinized tubes. Mice were then killed by cervical 
dislocation and spleens were removed aseptically. Single 
cell suspensions of splenocytes were prepared in HBSS by 
pressing the spleens between the frosted ends of sterile 
30 glass microscope slides* Collected cells were washed twice 
with PBS. Erythrocytes were eliminated from blood and 
splenocyte suspensions using RBC lysing buffer. 
Subsequently, cells were resuspended in PBS for 
enumeration. Recovered cells were then assayed for Ag 
35 expression using flow cytometry. 
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Two to five hundred thousand cells in 100 Ml of 
PBS/1% BSA/0.1% sodium azide were incubated on ice for 30 
min. with saturating amounts of various FITC- or 
phycoerythrin (PE) -conjugated Abs (Becton-Dickinson, 
5 Mountain View, CA) Abs used for staining included: HLe-l- 

FITC (IgGl anti-CD45), Leu 2-FITC (IgGl anti-CD8) , Leu 3 PE 
(IgGl anti-CD4), and Leu M3-PE (IgG2a anti-CD14) . Cells 
were then washed in cold buffer and fixed in 0.37% 
formaldehyde in PBS. Samples were analyzed on a FACscan 

10 (Becton-Dickinson) using log amplifiers. Regions to 

quantify positive cells were set based on staining of cells 
obtained from naive SCID mice. The absolute numbers of 
human Ag-positive cells recovered from SCID tissues were 
determined by multiplying the percent positive cells by the 

15 total number of cells recovered from each tissue sample. 

The total number of leukocytes in blood was calculated 
using a theoretical blood volume of 1.4 ml/mouse. The 
detection limit for accurate quantitation of human cells in 
SCID mouse tissues was 0.05%. All statistical comparison 

20 between treatment groups were made using the Mann-Whitney 

U test. Treatment groups were determined to be 
significantly different from buffer control groups when the 
p value was <0.05. Results are presented in Table 10 
below, wherein + indicates a significant difference from 

25 controls, - indicates an insignificant difference and NT 

means the conjugate was not tested. CD5 Plus (XOMA 
Corporation, Berkeley, California) is mouse H65 antibody 
chemically linked to RTA and is a positive control. 0X19 
Fab-Gelc2 47 is a negative control immunoconjugate. The 0X19 

30 antibody (European Collection of Animal Cell cultures 

#84112012) is a mouse anti-rat CDS antibody that does not 
cross react with human CDS. 
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10 



Test Article 

CDS Plus 
CH65 F(ab») 2 
cH65 Fab 1 
H65-rGEL 

cH65 F(ab» ) 2 -rGel 
CH65 Fab'-rGel 
CH65 Ffab'^-Gel^ 
cH65 Fab f -Gel c247 
helH65 Fab'-Gel c247 
CH65 Fab'-Gel^occi*) 
0X19 Fab-Gel c247 



Table 10 

Human T Cell De pletion 



SPleen 



+ 
+ 
+ 



Blood 
+ 



+ 

NT 



NT 



15 All the gelonin immunoconjugates were capable of depleting 

human cells in the SCID mouse model. 



Construction Of Gelonin 

Immunofusions With Chimeric Antibodies 

20 Several genetic constructs were assembled which 

included a natural sequence gelonin gene fused to an H65 
truncated heavy chain gene (Fd or Fd'), or an H65 light 
chain gene (kappa). In this Example , H65 Fd, Fd', and H65 
light chain refer to chimeric constructs. The H65 Fd 

25 sequence consists of the nucleotides encoding the murine 
H65 heavy chain variable (V), joining (J) and human IgGj, 
constant (C) domain 1 regions, including the cysteine bound 
to light chain IgG x and has the carboxyl terminal sequence 
SCDKTHT (SEQ ID NO: 130). The H65 Fd' sequence has the K65 

30 Fd sequence with the addition of the residues CPP from the 

hinge region of human IgGx heavy chain, including a cysteine 
residue which is bound to the other human IgG x heavy chain 
and its F(ab') 2 fragment. See Better, et al., Proc. Nat. 
Acad. Sci. (USA), 90z 457-461 (1993), incorporated by 

35 reference herein. 
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The H65 light chain sequence consists of the 
nucleotides encoding the murine H65 light chain variable 
(V), joining (J) , and human kappa (c t ) regions. The DNA 
sequences of the V and J regions of the H65 Fd and kappa 
5 fragment genes linked to the pelB leader can be obtained 

from GenBank (Los Alamos National Laboratories , Los Alamos, 
New Mexico) under Accession Nos. M90468 and M90467, 
respectively. Several of the gene fusions included a 
gelonin gene linked at the 5 1 end of an H65 Fab fragment 

10 gene while the others included a gelonin gene linked at the 
3' end of an H65 Fab fragment gene. A DNA linker encoding 
a peptide segment of the E. coll shiga-like toxin (SLT) 
(SEQ ID NO: 56) , which contains two cysteine residues 
participating in a disulfide bond and forming a loop that 

15 includes a protease sensitive amino acid sequence) or of 

rabbit muscle aldolase [ (RMA) as in SEQ ID NO: 57, which 
contains several potential cathepsin cleavage sites] was 
inserted between the gelonin gene and the antibody gene in 
the constructs* Alternatively, a direct fusion was made 

20 between a gelonin gene and an H65 Fab fragment gene without 

a peptide linker segment. Table 11 below sets out a 
descriptive name of each gene fusion and indicates the 
expression plasmid containing the gene fusion and the 
section of the application in which each is designated. 

25 Each plasmid also includes the Fab fragment gene (shown in 
parentheses in Table 11) with which each particular gene 
fusion was co-expressed. The inclusion of a cysteine from 
a hinge region (Fd') allows potential formation of either 
monovalent Fab' or bivalent F(ab') 2 forms of the expression 

30 product of the gene fusion. 
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5 


n / i i i \ 

B(lll) 


pING3759 


Gelonm: :RHA: :Fd' (kappa) 




B(iv) 


PING3758 


Gelonin : : RMA : : kappa ( Fd ' ) 




A(i) 


pING4406 


Fd: :SLT: : Gelonin (kappa) 




A(ii) 


pING4407 


kappa: :SLT: : Gelonin (Fd) 




A(iii) 


pING4408 


Fd: :RMA: : Gelonin (kappa) 


10 


A(iv> 


PING4410 


kappa: :RMA: : Gelonin (Fd) 




C(i) 


PING3 334 


Gelonin: : Fd (kappa) 



A* Fusions Of Gelonin At The 

Carboxvl-Terminus Of Antibody Genes 

(i) F3; ?SLTt sG^lQTun (Kappa) 

15 A gelonin gene fusion to the 3* -end of the H65 Fd 

chain with the 23 amino acid SLT linker sequence was 
assembled in a three piece ligation from plasmids pVKl, 
PING3731 (ATCC 68721) and pING4000. Plasmid pVKl contains 
the Fd gene linked in- frame to the SLT linker sequence and 

20 some H65 Fd' and kappa gene modules as in pING3217, shown 

in Better, et al. # Proc. tfat. Acad. Sci. (USA): 457-461 
(1993) , except that the kappa and Fd' regions are reversed. 
Plasmid pING3731 contains the gelonin gene, and pING4000 
contains the H65 kappa and Fd' genes each linked to the 

25 pelB leader sequence under the cpntrol of the araB promoter 
as a dicistronic message. 

Plasmid pVKl was designed to link the 3 ' -end of 
a human IgG Fd constant region in-frame to a protease- 
sensitive segment of the SLT gene bounded by two cysteine 

30 residues which form an intra-chain disulfide bond. The SLT 
gene segment (20 amino acids from SLT bounded by cysteine 
residues, plus three amino acids introduced to facilitate 
cloning) was assembled from two oligonucleotides, SLT 
Linker 1 and SLT Linker 2. 
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SLT Linker 1 (SEQ ID NO: 73) 
5 1 TGTCATCATCATGCATCGCGAGTTGCCAGAATGGCATCT 
GATGAGTTTCCTTCTATGTGCGCAAGTACTC 3 • 
SLT Linker 2 (SEQ ID NO: 74) 
5 5 ■ TCGAGAGTACTTGCGCACATAGAAGGAAACTCATCAGAT 

GCCATTCTGGCAACTCGCGATGCATGATGATGACATGCA 3 ' 
The two oligonucleotides were annealed and ligated into a 
vector (pING3185) containing PstI and Xhol cohesive ends, 
destroying the PstI site and maintaining the Xhol site. 
10 Plasmid pING3135 contained an engineered PstI site at the 
3* -end of the Fd gene, and contained an Xhol site 
downstream of the Fd gene. The product of this ligation, 
pVKl, contained the H65 Fd gene (fused to the pelB leader) 
in frame with the SLT linker segment, and contained two 
15 restriction sites, -Fspl and Seal, at the 3 '-end of the SLT 
linker. 

Plasmid pVKl was digested with Saul and Seal, and 
the 217 bp fragment containing a portion of the Fd constant 
domain and the entire SLT gene segment was purified by 

20 electrophoresis on an agarose gel. pING3731 was digested 

with Smal and Xhol and the 760 bp gelonin gene was 
similarly purified. Plasmid pING4000 was digested with 
Saul and Xhol and the vector segment containing the entire 
kappa gene and a portion of the Fd gene was also purified. 

25 Ligation of these three DNA fragments resulted in pING4406 

containing the Fd: : SLT: : Gelonin (kappa) gene fusion vector. 



(ii) Kma? ;SLT? :Geipnin (Fd) 

A gelonin gene fusion to the 3 v ~end of the H65 
kappa chain with the 25 amino acid SLT linker sequence (20 
30 amino acids from SLT bounded by cysteine residues, plus 5 
amino acids introduced to facilitate cloning) was assembled 
from the DNA segments in pING3731 (ATCC 68721) and 
pING3713. 

Plasmid pING3713 is an Fab expression vector 
35 where the H65 Fd and kappa genes are linked in a 
dicistronic transcription unit containing the SLT linker 
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segment cloned in-frame at the 3* -end of the kappa gene. 
The plasmid was constructed as follows. In a source 
plasmid containing the H65 Fd and Jcappa genes, an Eagl site 
was positioned at the 3* -end of the Jcappa gene by site 
5 directed mutagenesis without altering the encoded amino 

acid sequence. The SLT gene segment from pVKl was 
amplified with primers SLT-SagI-5 1 and Sail for in frame 
linkage to the Eagl site at the 3' -end of the kappa gene. 
SLT-Eag-5' (SEQ ID NO: 75) 
10 5 f TGTTCGGCCGCATGTCATCATCATGCATCG 3* 

Sail (SEQ ID NO: 76) 
5» AGTCATGCCCCGCGC 3 f 
The 140 bp PCR product was digested with Eagl and Xhol, and 
the 75 bp fragment containing the SLT gene segment was 
15 cloned adjacent to the Fd and kappa genes in the source 

plasmid to generate pING3713. 

For construction of gene fusion to gelonin, 
PING3713 was cut with Seal and Xhol, and the vector 
fragment containing the Fd gene and kappa:: SLT fusion was 
20 purif ied. pING3731 was digested with Smal and Xhol and the 

DNA fragment containing the gelonin gene was also purified. 
The product of the ligation of these two fragments, 
pING4407, contains the Fd and kappa :: SLT :: gelonin genes. 



(iii) Fd: :RMA: : Gelonin (kappa! 

25 A gelonin gene fusion to the 3 f -end of the H65 Fd 

chain with the 21 amino acid RMA linker sequence (20 amino 
acids from PMA, plus 1 amino acid introduced to facilitate 
cloning) was assembled in a three piece ligation from 
plasmids pSH4, pING3731 (ATCC 68721) and pING4000. 

30 Plasmid pSH4 contains an Fd gene linked in frame 

to the RMA linker sequence. The RKA gene segment was 
linked to the 3* -end of Fd by overlap extension PCR as 
follows. The 3 '-end (constant region) of the Fd gene was 
amplified by PCR from a source plasmid with the primers 

35 KBA-72 and RMAG-1. Any Fd constant region may be used 
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because constant regions of all human IgG : antibodies are 
identical in this region. 

KBA-72 (SEQ ID NO: 77) 

5 1 TCCCGGCTGTCCTACAGT 3 • 
5 RMAG-1 (SEQ ID NO: 78) 

5 1 TCCAGCCTGTCCAGATGGTGTGTGAGTTTTGTCACAA 3* 
The product of this reaction was mixed with primer RMA-76, 
which annealed to the amplified product of the first 
reaction, and the mixture was amplified with primers KBA-72 
10 and RMAK-2. 

RMA-76 (SEQ ID NO: 79) 

5 • CTAACTCGAGAGTACTGTATGCATGGTTCGAGATGAACA 
AAGATTCTGAGGCTGCAGCTCCAGCCTGTCCAGATGG 3 ' 
RMAK-2 (SEQ ID NO: 80) 
15 5 1 CTAACTCGAGAGTACTGTAT 3 1 

The PCR product contained a portion of the Fd constant 
region linked in- frame to the RMA gene segment. The 
product also contained a Seal restriction site useful for 
in- frame fusion to a protein such as gelonin, and an Xhol 
20 site for subsequent cloning. 

This PCR product was cut with Saul and Xhol and 
ligated adjacent to the remainder of the Fd gene to 
generate pSH4. 

For assembly of the gene fusion vector containing 
25 the Fd: : RMA: : Gelonin, kappa genes, pSH4 was cut with Saul 
and Seal and the Fd: :RMA segment was purified. Plasmid 
PING3731 was cut with Smal and Xhol and the 760 bp DNA 
fragment containing the gelonin gene was purif ied # and 
pING4000 was cut with Saul and Xhol and the vector was 
30 purified. The product of the ligation of these fragments, 
pING4408, contained the Fd: : RMA: : Gelonin and kappa genes • 

(iv) kannas; RMA:; Gelonin fFd) 

A gelonin gene fusion to the 3 f -end of the H65 
kappa chain with the 21 amino acid RMA linker sequence was 
35 assembled in a three piece ligation from plasmids pSH6, 

pING4408 (see the foregoing paragraph) and pING3713. 
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Plasmid pSH6 contains a kappa gene linked in- 
frame to the RMA linker sequence • The RMA gene segment was 
linked to the 3 ' -end of kappa by overlap extension PCR as 
follows. The 3 f -end (constant region) of the kappa gene 
5 was amplified by PCR from a source plasmid with the primers 

KBA-K2 and RMAK-1. 

RMAK-1 (SEQ ID NO: 81) 

5 » TCCAGCCTGTCCAGATGGACACTCTCCCCTGTTGAA 3 1 
KBA-K2 (SEQ ID NO: 82) 

10 5 f GTACAGTGGAAGGTGGAT 3 • 

The product of this reaction was mixed with primer RMA-76 
(SEQ ID NO: 81) , which annealed to the amplified product of 
the first reaction, and the mixture was amplified with 
primers KBA-K2 and RMAK-2 • The PCR product contained a 

15 portion of the kappa constant region linked in-frame to the 

RMA gene segment* The product also contained a Seal 
restriction site useful for in- frame fusion to a protein 
sue as gelonin, and an Xhol site for subsequent cloning. 
This PCR product was cut with SstI and Xhol and ligated 

20 adjacent to the remainder of the kappa gene to generate 

pSH6. 

For assembly of the gene fusion vector containing 
the kappa: : RMA: : Gelonin and Fd genes, pSH6 was cut with 
Hindlll and PstI and the DNA fragment containing the kappa 

25 constant region and a portion of the RMA linker (the PstI 

RMA linker segment contains a PstI site) segment was 
purified. Plasmid pING4408 was cut with PstI and Sail and 
the DNA fragment containing a segment of the RMA linker, 
the gelonin gene and a portion of the tetracycline 

30 resistance gene in the vector segment was purified. 

pING3713 was cut with Sail and ffindlll and the vector was 
purified. The product of the ligation of these three 
fragments, pING4410, contained the kappa: : RMA: : Gelonin and 
Fd genes. 
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B. Fusions Of Gelonin At The 

Amino-Terminus Of Anti body Genes 

(i) Gelonin: :SLT; :Fd' f kappa y 

A gelonin gene fusion to the S'-end of the H65 
5 Fd' chain with a 25 amino acid SLT linker sequence (20 

amino acids from SLT bounded by cysteine residues, plus 
five amino acids introduced to facilitate cloning) was 
assembled in a three piece ligation from plasmids pING3748, 
PING3217, and a PCR fragment encoding the H65 heavy chain 
10 variable region (V H ) gene segment which is the variable 

region of the Fd' gene in pING3217. Plasmid pING3748 
contains the gelonin gene linked in-frame to the SLT linker 
sequence, and pING3217 contains the H65 Fd' and kappa genes 
in a dicistronic transcription unit. 
15 Plasmid pING3825 (see Example 2) was amplified 

with PCR primers qelo3'-Eag and gelo-9 to introduce an Eagl 
restriction site at the 3* -end of the gelonin gene by PCR 
mutagenesis. 

gelo3 f -£agr (SEQ ID NO: 83) 
20 5« CATGCGGCCGATTTAGGATCTTTATCGACGA 3 f 

The PCR product was cut with Sell and Zagl and the 56 bp 
DNA fragment was purified. Plasmid pING37l3 was cut with 
Eagl and Xhol, and the 77 bp DNA fragment containing the 
SLT linker was purified. The 56 bp Sell to Eagl fragment 
25 and the 77 bp Eagl to Xhol fragment were ligated into 

PING3825 which had been digested with Bell and Xhol to 
generate pING3748 which contains the gelonin gene linked 
in- frame to the SLT linker sequence. 

For assembly of the gene fusion vector containing 
30 the Gelonin: : SLT: :Fd' and kappa genes, the H65 V a was 
amplified by PCR from pING3217 with primers H65-G1 and H65- 
G2, and the product was treated with T4 polymerase followed 
by digestion with Ndel. 

H65-G1 (SEQ ID NO: 84) 
35 5' AACATCCAGTTGGTGCAGTCTG 3 1 

H65-G2 (SEQ ID NO: 85) 
5* GAGGAGACGGTGACCGTGGT 3 f 
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The 176 bp fragment containing the 5 '-end of the H65 heavy 
chain V-region was purified. Concurrently, piNG3217 was 
digested with Ndel and Xhol, and the 13 07 bp DNA fragment 
containing a portion of the Fd' gene and all of the kappa 
5 gene was purified. The two fragments were ligated to 

pING3748 which had been digested with Seal and Xhol in a 
three piece ligation yielding pING3754 (ATCC 69102), which 
contains the Gelonin: : SLT: : Fd' and kappa genes. 



(ii) Gelonin: : SLT: : kappa f FdM 

10 A gelonin gene fusion to the 5' -end of the H65 

kappa chain with the 25 amino acid SLT linker sequence was 
assembled in a three piece ligation from plasmids pING3748 
(see the foregoing section), pING4000, and a PCR fragment 
encoding the H65 light chain variable region (V L ) gene 

15 segment. 

For assembly of the gene fusion vector containing 
the Gelonin: : SLT: : kappa and Fd' genes, an H65 V L fragment 
was amplified by PCR from pING3217 with primers H65-K1 and 
JKl-ifxndlll, and the product was treated with T4 polymerase 
20 followed by digestion with Hindlll. 

H65-K1 (SEQ ID NO: 86) 

5* GACATCAAGATGACCCAGT 3* 

JKl-Hindlll (SEQ ID NO: 87) 

5 f GTTTGATTTCAAGCTTGGTGC 3 f 
25 The 306 bp fragment containing the light chain V-region was 
purified. Concurrently, pING4000 was digested with Hxndlll 
and Xhol, and the 1179 bp DNA fragment containing the kappa 
constant region and all of the Fd' gene was purified. The 
two fragments were ligated to pING3748 which had been 
30 digested with Seal and Xhol in a three piece ligation 

yielding pING3757, which contains the Gelonin: : SLT: : kappa 
and Fd genes. 
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(iii) Gelonin: : RMA: ! Fd* f*ap r ») 

A gelonin gene fusion to the 5* -end of the H65 
Fd' chain with the 24 amino acid RMA linker sequence (20 
amino acids from RMA, plus 4 amino acids introduced to 
5 facilitate cloning) was assembled in a three piece ligation 

from plasmids pING3755, pING3217 and a PCR fragment 
encoding the H65 V H gene segment. Plasmid pING3755 contains 
the gelonin gene linked in-frame to the RMA linker 
sequence, and pING3217 contains the H65 Fd' and kappa genes 

10 in a dicistronic transcription unit. 

Plasmid pIHG3755 was assembled to contain the 
gelonin gene linked to the RMA linker gene segment. The 
RMA linker gene segment was amplified by PCR from pSH4 with 
primers RMA-JffagI and HJtfDIII-2. 

15 RMA-SagI (SEQ ID NO: 88) 

5 » ACTTCGGCCGCACCATCTGGACAGGCTGGAG 3 1 
HINDZT1-2 (SEQ ID NO: 44) 
5 t CGTTAGCAATTTAACTGTGAT 3' 
The 198 bp PCR product was cut with Eagl and Jfindlll, and 

20 the resulting 153 bp DNA fragment was purified. This RMA 

gene segment was cloned adjacent to gelonin using an PstI 
to EagI fragment from pING3748 and the PstI to HlndZXX 
vector fragment from pING3825. The product of this three 
piece ligation was pING3755. 

25 For assembly of the gene fusion vector containing 

the Gelonin: : RMA: : Fd' , kappa genes, the H65 V a was amplified 
by PCR from pING3217 with primers H65-G1 (SEQ ID NO: 84) 
and H65-G2 (SEQ ID NO: 85) , and the product was treated 
with T4 polymerase followed by digestion with tfdel. The 

30 186 bp fragment containing the 5 9 -end of the heavy chain V- 

region was purified. Concurrently, pING3217 was digested 
with NdeX and Xhol, and the 1307 bp DNA fragment containing 
a portion of the Fd' gene and all of the kappa gene was 
purified. These two fragments were ligated to pING3755 

35 which had been digested with Seal and XhoX in a three piece 
ligation yielding pING3759 (ATCC 69104), which contains the 
Gelonin: : RMA: :Fd' and kappa genes. 



WO 94/26910 



PCT/US94/05348 



-98- 

(iv) Gelonin: : RMA* : kap pa 

A gelonin gene fusion to the 5 1 -end of the H65 
kappa chain with the 24 amino acid RMA linker sequence was 
assembled in a three piece ligation from plasmids pING3755, 
5 pING4000, and a PCR fragment encoding the H65 V L gene 

segment. 

For assembly of the gene fusion vector containing 
the Gelonin: : RMA: : kappa and Fd' genes, an H65 V L segment was 
amplified by PCR from pING3217 with primers H65K-1 (SEQ ID 

10 NO: 86) and JKl-#indIII, and the product was treated with 
T4 polymerase followed by digestion with Hindlll. The 306 
bp fragment containing the 5 f -end of the light chain V- 
region was purified. Concurrently, pING4000 was digested 
with Hindlll and Xhol, and the 1179 bp DNA fragment 

15 containing the kappa constant region and all of the Fd' 
gene was purified. These two fragments were ligated to 
PING3755 which had been digested with Seal and Xhol in a 
three piece ligation yielding pING3758 (ATCC 69103), which 
contains the Gelonin: : RMA: : kappa and Fd' genes* 

20 C. Direct Fusions Of Gelonin At The 

Amino Terminus Of Antibody Genes 

(i) gglpftih: : yd' fy^ppq) 

A direct gelonin gene fusion was constructed from 
PING3754. pING3754 was digested with Bglll and Xhol and 

25 the vector segment was purified. Concurrently, pING3754 
was digested with Eagl, treated with T4 polymerase, cut 
with Bgrill, and the gelonin gene segment was purified. 
pING3754 was also cut with Fspl and Xhol, and the Fd and 
kappa gene segment was purified. These fragments were 

30 assembled in a three-piece ligation to generate pING3334, 
which contains a direct gene fusion of gelonin to Fd' in 
association with a kappa gene. 
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Bxample IS 

Preparation of he3 Fab And Gelonin he3Fah Immunofuginnc 

The sections below detail the construction of 
human-engineering he3Fab protein and immunofusions of 
5 gelonin to he3 Fd and kappa chains. 

A. he3-Fab Expression Plasmids 

The he3 heavy chain V-region was PCR-amplif ied 
from plasmid pING4621 (pING4621 is fully described above in 
Example 5 above), with primers H65-G3, 

10 GAGATCCAGTTGGTGCAGTCTG (SEQ ID NO: 116) and H65G2 (SEQ ID 

NO: 85) . Amplification was carried at using vent 
polymerase (New England Biolabs) for 25 cycles, including 
a 94 # C denaturation for 1 minute, annealing at 50*C for 2 
minutes, and polymerization for 3 minutes at 72 # C. The PCR 

15 product was treated with polynucleotide kinase and digested 

with SstEII and the V-region DNA was purified. The 
purified DNA fragment was then ligated into pIClOO, which 
had been digested with SstI, treated with T4 polymerase, 
and cut with SstEII. The resulting fragment was then 

20 ligated with the BstEII fragment from pING3218 (containing 

Fab' genes) to make pING4623 which contained the he3 Fd 
gene linked to the peiB leader sequence. 

The he3 kappa V- region was next assembled as 
described above in Example 5 and in co-owned, co-pending 

25 U.S. Patent Application Serial No. 07/808,464, incorporated 

by reference herein, using six oligonucleotide primers, 

$H65k-l, AGT CGT CGA CAC GAT GGA GAT GAG GAC CCC 
TGC TCA GTT TCT TGG CAT CCT CCT ACT CTG GTT TCC AGG TAT CAA 
ATG TGA CAT CCA GAT GAC TCA GT (SEQ ID NO: 117) ; 

30 HUH-K6, TCA CTT GCC GGG CGA ATC AGG ACA TTA ATA 

OCT ATT TAA GCT GGT TCC AGC AGA AAC CAG GGA AAG CTC CTA AGA 
CCC T (SEQ ID NO: 118) ; 

HUH-K7, TGA CTC GCC CGG CAA GTG ATA GTG ACT CTG 
TCT CCT ACA GAT GCA GAC AGG GAA GAT GGA GAC TGA GTC ATC TGG 

35 ATG TC (SEQ ID NO: 119) ; 
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HUH-K8, GAT CCA CTG CCA CTG AAC CTT GAT GGG ACC 
CCA GAT TCC AAT CTG TTT GCA CGA TAG ATC AGG GTC TTA GGA GCT 
TTC C (SEQ ID NO: 120); 

HUH-K4, GGT TCA GTG GCA GTG GAT CTG GGA CAG ATT 
5 ATA CTC TCA CCA TCA GCA GCC TGC AAT ATG AAG ATT TTG GAA TTT 

ATT ATT G (SEQ ID NO: 121) ; and 

HUH-K5 , GTT TGA TTT CAA GCT TGG TGC CTC CAC CGA 
ACG TCC ACG GAG ACT CAT CAT ACT GTT GAC AAT AAT AAA TTC CAA 
AAT CTT C (SEQ ID NO: 122) 
10 and amplified with primers HUK-7 (SEQ ID NO: 92) and JKl- 

Hindlll (SEQ ID NO: 87). 

The resulting PCR product was treated with T4 
polymerase, digested with Hindlll, and purified. The 
purified fragment was then cloned into pIClOO, which had 
15 first been cut with SstI, treated with T4 polymerase, and 

digested with Xhol, along with the 353 bp Hindlll-Xhol 
fragment encoding the kappa constant region from pING3217. 
The resulting plasmid was pING4627 which contains the he3 
kappa sequence linked in frame to the pelB leader. 
20 Plasmid pING4628, containing the pelB-linked he3 

kappa and Fd genes under transcriptional control of the 
araB promoter, was assembled from pING4623 and pING4627 as 
follows. 

An expression vector for unrelated kappa and Fd 
25 genes, pNRX-2, was first cut with Saul and EcoRl, leaving 

a vector fragment which contains all the features relevant 
to plasmid replication, a tetracycline resistance marker, 
araB transcriptional control, and the 3' end of the Fd 
constant region* [Plasmid pNRX-2 comprises an j?coRI to 
30 Xhol DNA segment from pING 3104 (described in WO 90/02569, 
incorporated by reference herein) . That segment contains 
the replication, resistance and transcription control 
features of pING3104 and is joined to an Xhol to Saul DNA 
segment from pING1444 (described in WO 89/00999, 
35 incorporated by reference herein) which contains the 3' end 
of an Fd constant region.] Next pING4623 was cut with 
PstX, treated with T4 polymerase, digested with Saul and 
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the peIB::Fd gene segment was then isolated • Plasmid 
pING4627 was cut with Xhol, treated with T4 polymerase, cut 
with EcoRl and ligated to the pelB::Fd gene segment and the 
pNRX-2 vector fragment to generate the he3-Fab expression 
5 vector pING4628. That plasmid contains two Xhol sites, one 

located between the kappa and Fd genes, and another 4 bp 
downstream of the termination codon for the Fd gene, 

A vector, pING4633, which lacks the Xhol site 
between the kappa and Fd genes was constructed. To 

10 assemble pING4633, pING4623 was cut with EcoRX, treated 

with T4 polymerase, digested with Saul, The pelB:: kappa 
gene segment was then isolated and purified. The pNRX-2 
vector fragment and the peIB::Fd gene segment were then 
ligated to the purified peIB::kappa gene segment to form 

15 pING4633, 

Both pING4633 and pING4628 are bacterial 
expression vectors for he3-Fab and each comprises the he3 
Fd and Kappa genes which are expressed as a dicistronic 
message upon induction of the host cell with L-arabinose . 
20 Moreover, pING4628 contains two Xhol restriction sites, one 
located 4bp past the Fd termination codon and one in the 
intergenic region between the 3' end of the Kappa gene and 
the 5' end of the Fd gene. Plasmid pING4633 lacks the Xhol 
site in the intergenic region. 



25 b. purification Qf fre3r?b 

Plasmids pING4628 and pING463 3 were transformed 
into E. coll £104. Bacterial cultures were induced with 
arabinose and cell-free supernatant comprising the he 3 Fab 
was concentrated and filtered into 20 mm HEPES, pH 6.8. 

30 The sample was then loaded onto a CM Spheradex column (2.5 
x 3 cm), equilibrated in 20 mH HEPEs, 1.5 mM NaCl, pH 6.8. 
The column was washed with the same buffer and eluted with 
20 mm HEPES, 27 mM NaCl, pH 6.8. The eluate was split into 
2 aliguots and each was loaded onto and eluted from a 

35 protein G (Bioprocessing) column (2.5 x 2.5 cm) separately. 

The protein G column was equilibrated in 20 mM HEPES, 75 MM 
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NaCl, pH 6.3 and the sample was eluted with 100 mM glycine, 
100 mM NaCI, pH 3.0. The two eluates were combined and 
diluted two times with 20 mM HEPES, 3 M ammonium sulfate, 
pH 6.8. The diluted eluates were loaded onto phenyl 
5 sepharose high substitution Fast Flow (Pharmacia) column 

(2.5 x. 3.3 cm) , equilibrated n 20 dm HEPES, 1.5 M ammonium 
sulfate, pH 6.8. The column was then eluted with 2 0 mM 
HEPES, 0.6 M ammonium sulfate, pH 6.8. 

C. Gelonin: zRMA: :he3Kapoa, he3Fd Fusions 

10 Other genetic constructs were assembled which 

included a natural sequence gelonin gene fused to an he3- 
Fab via a linker. 

A fusion comprising Gelonin: :RMA: : he 3 Kappa, Fd 
was assembled from DNA from plasmids pING3755, pING4 633, 

15 and pING4628. Both pING4633 and pING4628 were assembled in 

a series of steps whereby the he 3 heavy and light v-regions 
were individually linked in-frame to the pelB leader. The 
heavy and light V-regions were then placed together in a 
dicistronic expression vector under the control of the araB 

20 promoter in a bacterial expression vector. 

Assembly of the Gelonin: :RMA: :he3 Kappa, he3Fd 
fusions was accomplished by constructing three DNA 
fragments from plasmids pING3755, pING4633, and pING4628. 
The first such fragment was made by digesting pING3755 with 

25 Seal and Xhol which excises the 4bp between those sites. 

The resulting vector fragment was purified. The second 
fragment for use in constructing the above fusions was 
obtained from plasaid pING4633, which was cut with Asel 
(which cuts in V L ) and Xhol and the resulting 1404 bp 

30 fragment, containing the 3' end segment of the Kappa and Fd 
genes, was purified. The third fragment, comprising the 5' 
end of the Kappa variable region coding sequence, was 
prepared from the PCR amplified V L gene contained in 
PING4628 using the oligonucleotide primers, Huk-7 and jkl- 

35 tfindlll. The resulting 322 bp PCR-amplif ied V L fragment was 
treated with T4 polymerase, digested with Asel, and the 86 
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bp fragments containing the 5' end of v L was purified. The 
three fragment produced above were ligated together to form 
PING3764. The DNA sequence of the PCR amplified V-region 
was verified by direct DNA sequencing of pING3764* 



D- Gelonin: : SLT: :he 3Kappa. ha3Fd Fi^ ?n 

A Gelonin: :SLT: :he3Kappa, he3Fd fusion was 
constructed by ligating the pING4633 and plNG4628 fragments 
described in section A immediately above with a fragment 
produced from pING3748 which contains Gelonin: : SLT. The 
pING3748 fragment was produced using Seal and Xhol as 
described immediately above for pING3755. The resulting 
vector was designated pING3763. 



E. Construction of Expression Vector Containing 
Gelonin; ;SLT: :he 3Fd, ha3kappa Fusions ^ 

15 An expression vector containing the 

Gelonin: :SLT: :he3Fd, he3kappa fusion was constructed in two 
steps form DNA segments from plasmids pING3825, plNG4628, 
pING4639, pING3217 [described in Better, et al., Proc. 
Natl. Acad. Sci. (USA), 90:457-461 (1993), incorporated by 

20 reference herein], and pING4627. pING3825 was digested 

with Ncol and Xhol, generating a 654 bp fragment containing 
the 3' end of the gelonin gene and a fragment containing 
the 5' end of the gelonin gene which were purified ♦ Next, 
PING4639 was digested with tfcol and Ndel and the 903 bp 

25 fragment containing the 3' end of the Gelonin gene, the SLT 

linker, and the 5' end of V a which resulted was purified. 
Finally, pING4628 was cut with Ndml and Xhol, resulting in 
a 523 bp fragment containing the 3' end of the Fd gene 
which was purified. The three fragments were then ligated 

30 to form plasmid pING3765 which contains a gene encoding a 
gelonin: : SLT: :he3Fd fusion. 

Three vector fragments were used to assemble the 
final expression vector (containing the gelonin: : SLT: : he 3 Fd 
and he3 kappa segments) . Plasmid pING3765 was digested 

35 with Xhol, treated with T4 polymerase, cut with Miel (which 
releases a 265 bp fragment encoding the tetracycline 



WO 94/26910 ~ ~ PCT/US94/0O48 



-104- 

resistent marker) , and the resulting vector fragment was 
purified. Plasmid plNG4 627, which contains the he3Kappa 
gene linked in-frame to the pelB leader was used for the 
construction of pING4628. Plasmid pING4627 was cut with 
5 PstI, treated with T4 polymerase, and further digested with 

Sstl. The resulting 726 bp fragment, containing the Kappa 
gene (except 40 bp at the 3' end) was purified. Plasmid 
PING3217 was then cut with Sstl and Nhel, resulting in a 
318 bp fragment containing the 3' end of the Kappa gene and 
10 downstream portion, including a portion of the tetracycline 

resistance gene, which was purified. Ligation of the 
foregoing three fragments produced the final expression 
vector, pING3767. 



F. Construction Of Expression Vector 
15 Containing Gelonin: :RMA; :he3Fd Fusions 

Gelonin: :RMA:he3Fd, he3Kappa fusion expression 

vectors was constructed in two steps from plasmids 

pING3825, pING462S, pING3217, and pING4627. The cloning 

scheme used was identical to that used to generate plNG3767 

20 except that pING4638 was substituted for pING4639. Plasmid 

pING4638 differs from pING4639 as described below in 

Example 16 ♦ The intermediate vector encoding the 

Gelonin: :RMA: :Fd fusion was designated pING3766 and the 

final expression vector was designated pING3768. 



25 Example 16 

The natural sequence gelonin gene was also fused 
to a single chain form of the human engineered he 3 H65 
variable region. The gelonin gene was positioned at either 
30 the N-terminus or the C-terminus of the fusion gene and the 

SLT or RMA linker peptide was positioned between the 
gelonin and antibody domains to allow intracellular 
processing of the fusion protein with subsequent cytosolic 
release of gelonin. 
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A. Construction of Gel: :RMA: :SCA(V L -v H ) , Gel : : SLT: : SCA 

fV,-V^ . Gel; :RMA: :SCArv r y T ) , and Gel: :SLT: :SCAfV r v. | 

A single chain antibody (SCA) form of the he3 H65 

variable domain was assembled from previously constructed 

5 genes. This SCA segment consisted of the entire v and j 

region of the one chain (heavy or light) linked to the 

entire V and J segment of the other chain (heavy or light) 

via a 15 amino acid flexible peptide: [(Gly)« Ser] 3 . This 

peptide is identical to that described in Huston et al., 

10 Proc. Natl. Acad. Sci. USA, 85:5879-5883 (1988); 

Glockshuber et al., Biochemistry, 29:1362-1367 (1990); and 
Cheadle et al., Molecular Immunol., 29:21-30 (1992). The 
SCA was assembled in two orientations: v- 
Jk.pp«: s [ (Gly) 4 Ser] 3 : :V-J Ga(m . and V-J G _: : [ (Gly) 4 Ser] 3 : :V-J kBppm . 

15 Each SCA segment was assembled and subsequently fused to 

gelonin. 

For assembly of the SCA segment V- 
^k.pp.: = C (Gly) 4 Ser] 3 : tV-J^^ primers HUK-7 and SCFV-1 were 
used to amplify a 352 bp DNA fragment containing the he3 
20 V/J kappa sequences from pING4627 by PCR in a reaction 

containing 10 mM KC1, 20 mM TRIS pH 8.8, 10 mM (NH 4 ) 2 S0 2 , 2mM 
MgSO w 0.1% Triton X-100., 100 ng/ml BSA, 200 uM of each 
dNTF, and 2 Units of Vent polymerase (New England Biolabs, 
Beverley, Massachusetts) in a total volume of 100 m1* 
25 SCFV-1 (SEQ ID NO: 91) 

5 9 CGGACCCACCTCCACCAGATCCACCGC 

CACCTTTCATCTCAAGCTTGGTGC 3 f 

HUK-7 (SEQ ID NO: 92) 

5* GACATCCAGATGACTCAGT 3 9 
30 Concurrently, primers SCFV-2 and SCFV-3 were used to 
amplify a he3 heavy chain V/J gamma segment from pING4623, 
generating a 400 bp fragment. 
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SCFV-2 (SEQ ID NO: 93) 

5 ' GGTGGAGGTGGGTCCGGAGGTGGAGGATCTGA 
GATCCAGTTGGTGCAGT 3 • 
SCFV-3 (SEQ ID NO: 94) 

5 1 TGTACTCGAGCCCATCATGAGGAGACGGTGACCGT 3 • 
The products from these reactions were mixed and amplified 
with the outside primers HUK-7 and SCFV-3. The product of 
this reaction was treated with T4 polymerase and then cut 
with Xhol. The resulting 728 bp fragment was then purified 
by electrophoresis on an agarose gel. This fragment was 
ligated into the vectors pING3755 and pING3748 (see Example 
10), each digested with Seal and Xhol. The resulting 
vectors pING4637 and pING4412 contain the Gelonin: :RMA: : sca 
v -^k«p P .: : [ (Gly) 4 Ser] 3 : :V-J Gm-€ and Gelonin: :SLT: :SCA 
15 V-J kappa : : [ (Gly) 4 Ser] 3 : :V-J G _ fusion genes, respectively. 

The 728 bp fragment was also ligated into pIClOO previously 
digested with SstI, treated with T4 polymerase and digested 
with Xhol, to generate pING4635. This plasmid contains the 
pelB leader sequence linked in- frame to the 
20 V-j Kapp , : : [ (Gly),Ser] 3 : :V-JJ i-BM gene. The pelB::SCA gene in 

PING4635 was excised as an EcoRI-Xhol restriction fragment 
and cloned into the bacterial expression vector to generate 
PING4640. 

Similarly, the SCA V-J 5—l : [ (Gly).Ser] 3 : :V-J kapp . 

25 was assembled by amplification of pING4627 with primers 

SCFV-5 and SCFV-6 generating a 367 bp fragment containing 

he3 V/J kappa sequences, 

SCFV-5 (SEQ ID NO: 95) 

5 • GGTGGAGGTGGGTCCGGAGGTGGAGGATCT 
30 GACATCCAGATGACTCAGT 3» 

SCFV-6 (SEQ ID NO: 96) 

5 f TGTACTCGAGCCCATCATTTCATCTCAAGCTTGGTGC 3' 
and pING4623 with primers H65-G3 and SCFV-4 generating a 
385 bp fragment containing he3 gamma V/J sequences by PCR 
35 with Vent polymerase. 

H65-G3 (SEQ ID NO: 97) 

5 • GAGATCCAGTTGGTGCAGTCTG 3' 
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SCFV-4 (SEQ ID NO: 98) 

5 ' CGGACCCACCTCCACCAGATCC 

ACCGCCACCTGAGGAGACGGTGACCGT 3 • 
The products from these reactions were mixed and amplified 
5 with H65-G3 and SCFV-6. The 737 bp product was treated 

with T4 polymerase and cut with Xhol. Ligation into 
PING3755 and pING3748 (digested with Seal and Xhol) 
resulted in assembly of the Gelonin : : RMA: : SCA v- 
Jg«».: s [ (Gly) A Ser] 3 : :V-J kappa gene fusion in pING4638 and 
10 Gelonin: :SLT: : SCA V-J GaoM : : [ (Gly) 4 Ser] 3 : :V-J kappa gene fusion 

in pING463 9, respectively. 

The vectors p!NG4637, pING4412, pING4638 and 
PING4639 were each transformed into E. coli strain E104 and 
induced with arabinose. Protein products of the predicted 
15 molecular weight were identified by Western blot with 

gelonin-specif ic antibodies. 

B. Construction of 

SCAfVT-V.) : ;SLT; : Gelonin Vectors 

The expression vector containing SCA(V L - 

20 V H ) :: SLT: : Gelonin fusions was assembled using restriction 
fragments from previously-constructed plasmids plNG4640 
(containing SCA(V L -V H )) plNG4407 (containing 
Kappa : : SLT : : Gelonin , Fd) , and pING3197. Plasmid pING4640 
was first cut with BspHI, filled in with T4 polymerase in 

25 the presence of only dCTP, treated with mung bean nuclease 

(MBN) to remove the overhang and to generate a blunt end, 
and cut with £coRX. The resulting 849 bp fragment was 
purified. The SLT-containing fragment from pING4407 was 
excised by cutting with EagI, blunted with T4 polymerase , 

30 m cut with Xhol, and the approximately 850 bp fragment which 
resulted was purified. The two fragments were ligated 
together into pING3197, which had been treated with EcoRl 
and Xhol to generate pING4642. The DNA sequence at the 
BspHI-T4-MBN/£agrI junction revealed that two of the 

35 expected codons were missing but that the fusion protein 
was in frame. 
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C. Construction of 

SCAfV^-V^ : :SLT: :Geloni n Vectors 

The expression vector containing the SCA(V a - 
V L ) : :SLT: : Gelonin fusions was assembled using DNA from 
5 plasmids pING4636, (the E. coll expression vector for 

SCA(V H -V L )) and pING4407. Plasmid pING4636 was cut with 
BstEII and Xhol and the resulting vector fragment was 
purified. Concurrently, pINg463 6 was used as a template 
for PCR with primers SCFV-7, 

10 5 ' TGATGCGGCCGACATCTCAAGCTTGGTGC (SEQ ID NO: 112) and H65- 

G13, TGATGCGGCCGACATCTCAAGCTTGGTGC3 ' (SEQ ID NO: 113). The 
amplified product was digested with EagI and BstEII and the 
resulting approximately 380 bp fragment was purified, 
Plasmid pING4407 was then cut with EagI and Xhol, resulting 

15 in an approximately 850 bp fragment, which was purified. 

The three above fragments were ligated together to produce 
PING4643. 

D. Construction of 

SCA(Vt-Vb) : :RMA: : Gelonin Vectors 

20 Expression vectors containing SCA(V L - 

V H ) : :RMA: rGelonin fusions were assembled using DNA from 
PING4640, pING4408 [Example 14A(iii)], and pING3825 
(Example 2C) . Plasmid pING4640 was cut with Sail and 
BstEII and the resulting approximately 700 bp vector 

25 fragment (containing the tetracycline resistance matter) 

was purified. Next, pING3825 was digested with Ncol and 
Sail, resulting in an approximately 1344 bp fragment 
containing the 3' end of the gelonin gene and adjacent 
vector sequences. That fragment was purified. Plasmid 

3 0 pING4408 was then PCR amplified with oligonucleotide 
primers , RMA-G3 5 ' TCTAGGTCACCGTCTCCTCACCATCTGGACAGGCTGGA3 ' 
(SEQ ID NO: 114), and gelo-10. The resulting PCR product 
was cut with BstEII and Ncol to generate an approximately 
180 bp fragment containing the 3' end of V a , RMA, and the 5' 

35 end of the Gelonin gene which was purified* The above 
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three fragments were ligated to generate the final 
expression vector, pING4644. 

E. Construction of 

SCAfV,-V L ) : : RMA: :Gelonin Vectors 

5 Expression vectors containing SCA(V H - 

V L ) : :RMA: :Gelonin were constructed using DNA from plNG 4 636, 
pING4410, and pING3825. Plasmid pING4636 was digested with 
Sail and Hlndlll and the resulting vector fragment was 
purified* Next, pING3825 was cut with Ncol and Sail and 

10 the 1344 bp fragment which resulted contained the 3' end of 

the gelonin gene and adjacent vector sequences encoding 
tetracycline resistance was purified. Finally , pING4410 
was PGR amplified with primers RMA-G4 , 
5 ' TTCGAAGCTTGAGATGAAACCATCTGGACAGGCTGGA3 ' (SEQ ID NO: 115) 

15 and gelo-10. The PCR product was cut with Hijjdlll and 

tfcol, resulting in a 180 bp fragment containing the 3 'end 
of V L , RMA, and the 5' end of Gelonin and was purified. The 
three above fragments were ligated together to generate the 
final expression vector, pING4645, 

20 Gelonin: :SCA fusions without a cleavable linker 

may be constructed by deletion of the SLT linker in 
pING4412 using the restriction enzymes £agl and Fspl. 
Digestion at these sites and religation of the plasmid 
results in an in-frame deletion of the SLT sequence. 

25 sample 

Multivalent Immunofusions 

Multivalent forms of the immunofusions may be 
constructed. 

A. Construction of (Gel: : RMA: : kappa, Fd')2 
30 and fGel:: RMA::Fd'. kappa). Expression Vectors 

Bacterial Fab express ionn vectors can result in 

the production of F(ab')2 if the two cysteine residues from 

the human IgGl hinge region are included into the carboxyl- 

terminus of the Fd protein [Better et al., Proc. Natl. 

35 Acad. Sci. USA, 90:457-461 (1993)], To express a gelonin 
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fusion protein that could form a bi-valent structure like 
an F(ab') 2 , the he 3 Fd' (2C) hinge region (Better et al . , 
supra) was cloned into the expression vector pING37 64 
(Example 15C) encoding the fusion protein Gel: :RMA: : kappa, 
5 Fd. 

Plasmid pING3764 was cut with Xhol and Ssu3 6I and 
the approximately 7500 bp fragment containing the 
immunofusion gene and vector sequences was purified. 
Plasmid pING4629, which encodes he3 F(ab') 2/ was also cut 

10 with Bsu36I and Xhol, and the approximately 200 bp DNA 

fragment containing the he3 Fd' (2C) gene segment was 
purified. These two DNA fragments were ligated to generate 
PING3775 encoding (Gel: :RMA: : kappa, Fd') 2 . An expression 
vector encoding the fusion protein (Gel: :RMA: :Fd' , kappa) 2 

15 was also made. 



B. Construction of Vectors Containing Both Gel : :RMA: : Fd 
and Gel: :RMA: :K Fusions 

In order to construct a plasmid comprising 

Gel::RMA::Fd and Gel: :RMA: :k fusions, plasmid pING3764 

20 [described above in Example 15(b)] was digested with Bsgl 

and Saul and a 5.7 kb vector fragment containing plasmid 
replication functions, Gel: :RMA: :k, and the 3' end of Fd 
was isolated and purified. Plasmid pING376S [described 
above in Example 15(E)] was digested with Saul and 

25 partially digested with PstI and a 1.5 kb fragment 
containing Gel : : RMA : : Fd was purified. Finally, pING4000 
[described above in Example 14] was digested with Bsgl and 
PstI, generating a 350 bp fragment containing the 3' end of 
the kappa gene. That fragment was purified and the 5.7 kb, 

30 1.5 kb, and 350 kb fragments described above were ligated 

together to form pING3770, containing the gelonin: :RMA: :k 
and gelonin : RMA : : Fd fusions. 
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C. Construction of Vectors Containing Both Gel: : SLT: :Fd 
and Gel::SLT::k Fusions 

Plasmid pING3772 contains the above-entitled 

fusions and was constructed as follows. Flasinid plNG3 7 63 

5 [described above in Example 15(D)] was digested with Bsgl 

and Saul and a 5.7 kb fragment containing the replication 

functions, the 5' end of Gel::SLT::k and the 3' end of Fd 

was generated and purified. Next, plasmid pING3 7 67 

[described in Example 15(D) above] was digested with Saul 

10 and PstI, generating a 1.5 kb fragment containing the 5' 

end of the gel: : SLT: : Fd fusion. That fragment was purified 
and pING4000 [described in Example 14 above] was digested 
with Bsgl and PstI. The resulting 350 bp fragment was 
purified and the above-described 5.7 kb, 1.5 kb, and 350 bp 

15 fragments were ligated to form pING3772. 



D. Expression of Multivalent Fusions 

Both pING3770 and pING3772 were transformed into 

E. coli (E104) cells by techniques known to those of 
ordinary skill in the art and induced with arabinose. 

20 Concentrated supernatants from the transformed cell 

cultures were analyzed by Western blot analysis with rabbit 
anti-gelonin antiserum. Transf ormants from both plasmids 
generated a reactive band on the gel at the size expected 
for a Fab molecule carrying two gelonins (approximately 105 

25 kD) . These results are consistent with the production of 
fusion proteins comprising monovalent Fab, with both Fd and 
kappa chains separately fused to gelonin. 

E. coli strains containing plasmids pING3775, 
pING3770 and pING3772 were grown in ferment ers and the 

30 fusion protein products were purified. The 

(Gel : : RMA : : kappa , Fd ' ) 2 expressed from pING3775 was purified 
as described in Better et al., supra. 
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Construction of Expression Vectors 
Encoding Imnunofusions Without Linkers 

Expression vectors encoding direct fusions of 
5 gelonin and dicistronic he3 Fab protein or single chain 

antibody were constructed as follows, 
A. VnV T : ; Gel 

Plasmid pING4642 (Example 16B) which encodes the 
V L V B : :SLT: :Gel fusion protein was cut with Fspl and tfcol, 

10 and the approximately 100 bp DNA fragment containing the 

5 '-end of the gelonin gene was purified. Plasmid pING4643 
(Example 16C) , which encodes the V H V L : : SLT: :Gel fusion 
protein, was cut with EagZ, treated with T4 polymerase and 
cut with Pstl. The approximately 850 bp DNA fragment 

15 encoding the V H V L gene segment was purified. The DNA 
fragments from pING4642 and pING4643 were ligated into the 
vector DNA fragment from pING4644 (Example l€D) 

that had been cut with Pstl and tfcol to generate pING378l, 
20 which encodes the V H V L : :Gel direct gene fusion. 



B. V T y H ?t(?el 

Plasmid pING4640 which encodes the he3 SCA gene 
V L V H was cut with BspHI, treated with T4 polymerase in the 
presence of the nucleotide dCTP only, treated with mung 
25 bean nuclease to remove the remaining 5' overhang, and then 
cut with EcoKI. The approximately 800 bp DNA fragment 
containing the he3 V L V H gene was then purified on an agarose 
gel. 

Plasmid pING3781 which encodes the direct fusion 
30 V H V L ::Gel was digested with EagI, treated with T4 
polymerase, and then digested with Xhol. The approximately 
800 bp DNA fragment encoding the gelonin gene was then 
purified on an agarose gel. The two DNA fragments from 
PING4640 and pING3781 were ligated into the vector DNA from 
35 pING3767 which had been digested EcoRI and Xhol and 
purified on an agaraose gel. The resultant plasmid, 
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pING3348, encoded the v L V a ::Gel fusion protein. The DNA 
sequence at the fusion junction was verified by direct DNA 
sequencing. 

C Gel; :V H V T 

5 The plasmid pING3755 [Example 14B(iii)], which 

contains the gelonin gene with an engineered Eagl site at 
its 3' -end, was cut with Eagl, treated with T4 polymerase, 
and digested with tfcol. The approximately 650 bp DNA 
fragment containing the 3* -end of the gelonin gene was 

10 purified on an agarose gel. The plasmid pING4 639 (Example 

16A) encoding the fusion Gel: : SLT: : V H V L was cut with Xhol 
and then partially digested with Fspl. The approximately 
730 bp DNA fragment containing all of the he3 V H v L gene was 
then purified in an agarose gel (a single Fspl restriction 

15 site occurs in the V B gene segment, and the purified he3 V a V L 

gene was separated from the incomplete gene segment which 
was approximately 660 bp) * The two DNA fragments from 
PING3755 and pING4639 were ligated into the vector pING3825 
that had been digested with J/col plus Xhol and purified on 

20 an agarose gel. The plasmid pING3350 was generated which 

encoded the Gel::V H V L fusion protein* The DNA sequence at 
the fusion junction was verified by direct DNA sequencing. 

D. Gel::V,V- 

Plasmid pING3336 which encodes the he3 V L V H single 

25 chain antibody gene was cut with Sstl and Asel, and the 
approximately 5500 bp DNA fragment containing the 3 '-end of 
V L V H and downstream vector sequences was purified* 
(pING3336 is identical to pING4640 except that the V L V H gene 
encodes six histidine residues in frame at the carboxyl- 

30 terminus) • Plasmid PZNG4627 (Example 15A) served as a 
substrate for PCR amplification of the V H gene segment. 
Plasmid pING4627 was amplified with the two oligonucleotide 
primers HUK-7 (SEQ ID NO: 92) and JKl-Hindlll (SEQ ID NO: 
87) , the resultant product was treated with T4 polymerase 

35 and cut with Asel, and the 86 bp DNA fragment containing 
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the 5 '-end of the V L was purified. The DNa fragments from 
pING33 3 6 and pING4 627 were ligated to the approximately 
2 3 50 bp DNA fragment of pING3755 generated by digestion 
with Eagl, treatment with T4 polymerase and subsequent 
digestion with Sstl. The resultant vector containing the 
Gel::V L V H gene fusion was named pING4652. The DNA sequence 
of pING4652 was verified at ligation juctions. 

E. Gelt; kappa, Fd 

The direct gene fusion which encodes Gel::kappa, 
Fd was also assembled from DNA segments from three 
plasmids. Plasmid pING3764 (Example 15C) was digested with 
Hindlll and Xhol, and the approximately 1200 bp DNA 
fragment encoding the 3 1 -end of the kappa gene and the Fd 
gene was purified. Plasmid pING4652, which encodes a 
direct gene fusion of gelonin to the he3 SCA gene V L v H , was 
cut with Sglll and Hindlll, and the approximately 850 bp 
DNA fragment encoding the 3 1 -end of the gelonin gene and 
the V L region of kappa was purified. The DNA fragments from 
PING3764 and pING4652 were ligated into the vector fragment 
from pING3825 (Example 2C) that had been digested with 
BglU and Xhol to generate pING3784 encoding Gel::kappa, 
Fd. 

f. g3l;;Fflt Kapp* 

Plasmid pING37€8 (Example 15F) , which encodes the 
fusion protein Gel: :RMA: :Fd, kappa, was cut with Ndel and 
Nhel, and the DNA segment containing the majority of the 
he3 Fd gene, the he3 kappa gene and a portion of the 
tetracycline resistance gene of the vector was purified. 
Plasmid pING3350, which is described in section C above, 
was cut with Ndel and PstI, and the DNA fragment containing 
the 5* -end of the he3 Fd gene linked to the gelonin gene 
was purified. The DNA fragments from pING3350 and pING3763 
were ligated intq. the vector fragment from pING4633 
(Example 16D) that had been cut with Nhel and Pstl to 



WO 94/26910 

PCT/US94/05348 

-115- 

generate plNG3789. Plasmid pING3789 encodes the fusion 
protein Gel::Fd, kappa. 



10 



Example 19 

Alternative Ca theosin Cleavable Linker^ 

The segment of rabbit muscle aldolase chosen for 
the RMA linker described herein is known to contain peptide 
sequences susceptible to digestion vith cathepsins. other 
cathepsin-cleavable protein segments are effective targets 
for intracellular cleavage, and two particular amino acid 
sequences were included as cleavable linkers in additional 
immunofusions of the invention. These are the amino acid 
sequence KPAKFFRL (SEQ ID NO: 141 ("CCF") and KPAKFLRL (SEQ 
ID NO: 142) ("CCL") . Two oligonucleotides were synthesized 
that encode these peptide segments. Degeneracy was 
15 introduced at one nucleotide position in each synthetic 

primer to allow the appropriate amino acid to be encoded at 
the particular amino acid position in which CCF and CCL 
differ. The two oligonucleotides 5'- 

GGCCGCAAAGCCGGCTAAGTTCTT (A/C) CGTCTGAGT-3 ' (SEQ ID NO: 143) 
20 and 5 ' -ACTCAGACG (G/T) AAGAACTTAGCCGGCTTTGC-3 ' (SEQ ID NO: 

144). The oligonucleotide linkers were then used to 
assemble a family of fusion gene expression vectors 
encoding : Gel : : CCL : : kappa , Fd ; Gel : : CCF : : kappa , Fd ; 
Gel: :CCF: :V L V 8 ; and Gel: :CCL: : V H V L . 
25 The CCL and CCF linkers were also included 

in fusion vectors where the antigen-binding domain of the 
fusion protein was at the N- terminus of the fusion to 
generate expression vectors encoding immunofusions such as 
V L V H : :CCL: :Gel. 

30 Several of the fusion proteins with the CCL 

and CCF linkers were tested for cytotoxicity on the T cell 
lines HSB2 and PBMC and were comparable in activity to the 
fusion proteins containing the RMA linker. 
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Ex ample 2 0 

Expression And Purification Of Gelonin Immu nofusinns 
A. Expression Of Gelonin I^^ nof usions 

Each of the gelonin gene fusions whose 
5 construction is described in Example 15 was co-expressed 

with its pair H65 Fab gene in arabinose-induced E. coli 
strain E104. 

Expression products of the gene fusions were 
detected in the supernatant of induced cultures by ELI5A* 

10 Typically, a plate was coated with antibody recognizing 

gelonin. Culture supernatant was applied and bound Fab was 
detected with antibody recognizing human kappa coupled to 
horseradish peroxidase. H65 Fab fragment chemically 
conjugated to gelonin was used a standard. Alternative 

15 ELISA protocols involving coating a plate with antibody 

recognizing either the kappa or Fd or involving a detection 
step with anti-human Fd rather than anti-human kappa 
yielded similar results. Only properly assembled fusion 
protein containing gelonin, kappa and Fd was detected by 

20 this assay. Unassociated chains were not detected. 

The fusion protein produced from induced cultures 
containing expression vectors pING4406, 4407, 4408, and 
4410 in E. coli E104 accumulated at about 20-50 ng/ml. The 
fusion proteins expressed upon induction of pING3754, 3334, 

25 3758 and 3759 (but not pING3757) were expressed at much 

higher levels, at about 100 to 500 ng/ml. A fusion protein 
of about 70,000 Kd was detected in the concentrated E. coli 
culture supernatant by immunostaining of Western blots with 
either anti-human kappa or anti-gelonin antibodies. 

30 The Gelonin: :SLT: :Fd' (kappa) fusion protein from 

pING3754 (ATCC 69102) was purified from induced 10 L 
fermentation broth. The 10 L fermentation broth was 
concentrated and buffer exchanged into lOrnM phosphate 
buffer at pH 7.0, using an S10Y10 cartridge (Amicon) and a 

35 DC10 concentrator. The supernatant was purified by passing 
the concentrated supernatant through a DE52 column (20 x 5 
cm) equilibrated with 10 mM sodium phosphate buffer at pH 
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7.0. The flow-through was then further purified and 
concentrated by column chromatography on CM52 (5 x 10 cm) 
in 10 mM phosphate buffer. A 0 - 0.2 M linear gradient of 
NaCl was used to the elute the fusion protein, and 
5 fractions containing the fusion protein were pooled and 

loaded onto a Protein G column (lml). The fusion protein 
was eluted from protein G with 0.2 M sodium citrate , pH 5.5 
and then 0.2 M sodium acetate, pH 4.5, and finally, 0.2 M 
glycine, pH 2.5. The Gelonin: :RMA: :Fd' (kappa) and 

10 Gelonin: :RMA: : kappa (Fd') fusions proteins were purified 

from fermentation broths by similar methods except that the 
CM52 column step was eliminated, and the DE52 column was 
equilibrated with lOOmM sodium phosphate buffer at pH 7.0. 
The fusion proteins were not purified to homogeneity. 

15 Each of the three purified fusion proteins was 

then assayed for activity in the RLA assay and for 
cytotoxicity against the T-cell line HSB2. (T cells 
express the CD5 antigen which is recognized by H65 
antibody.) The RLA assay was performed as described in 

20 Example 4 and results of the assay are presented below in 

Table 12. 



Tabl« 13 

Fusion Protein IC50fp*O 

rGelonin 11 

25 Gelonin: :SLT: : Fd (kappa) 19 

Gelonin : : RMA : : Fd (kappa ) 2 8 

Gelonin: :RKA: : kappa (Fd) 10 



Two fusion proteins were tested in whole cell cytotoxicity 
assays performed as described in Example € (Table 13). As 
30 shown in Table 13, the fusion proteins were active. 

Gelonin: :SLT: :Fd( kappa) killed two T cell lines, H5B2 and 
CEM f with respective IC 50 s 2 -fold (HSB2) or 10-fold (CEM) 
higher than that of the gelonin chemically linked to H65. 
See Table 13 below for results wherein IC 30 values were 
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adjusted relative to the amount of fusion protein in each 
sample* 



Table 13 

IC 50 (pMT) 

Fusion Protein HSB2 Cells 

heSFab-Gel^c^) t 165 
Gelonin: :SLT: :Fd (kappa) 180 
Gelonin: :RMA: :Fd (kappa) 150 



CEM Cells 
173 
1007 
NT 



10 



These fusion protein showed similar activity on peripheral 
blood mononuclear cells (data not shown) . 



B. Purifica tion of Immunof usions 

(i) Immunof us ions Comprising cH65Fab' 
Immunof us ions comprising a cH€5Fab / fragment were 
purified from cell-free supernatants by passing the 

15 supernatant through a CM Spheradex (Sepacor) column (5cm x 

3cm) , equilibrated in 10 Mm Na phosphate at pH 7.0. 
Immunof us ion proteins bind to the column and are eluted 
with 10 mM Na phosphate, 200 mM NaCl, pH 7.0. The eluate 
was diluted two-fold with 20 Mm HEPES, 3 M ammonium 

20 sulfate, pH 7.6 and loaded onto a phenyl sepharose fast 

flow (Pharmacia) column (2.5 x3.5 cm) , equilibrated in 20 
mM HEPES, 1.2 M ammonium sulfate, pH 7.0. The column was 
next washed with 20 mM Hepes, 1.2 M ammonium sulfate, pH 
7.0 and eluted with 20 mM HEPES, 0.9 M ammonium sulfate, pH 

25 7.0. The phenyl sepharose eluate was concentrated to a 
volume of 2-4 ml in an Amicon stirred cell fitted with a 
YM10 membrane. The concentrated sample was loaded onto an 
S-200 (Pharmacia) column (3.2x 38 cm), equilibrated in 10 
mm Na phosphate, 150 mm NaCl, pH 7.0. The column was run 

30 in the same buffer and fractions were collected. Fractions 

containing the fusion protein of desired molecular weight 
were combined. For example, by selection of appropriate 
column fractions, both monovalent (gelonin-Fab' ) and 
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bivalent (gelonin 2 -F(ab / ) 2 forms encoded by pING3 758 were 
purified. 



(ii) Impinnn fusions Comprising he3Fab 
Immunofusions comprising he3Fab were purified as 
5 in the preceding section with the exception that the phenyl 

sepharose column was eluted with 20 mM HEPES, i.o M 
ammonium sulfate, pH 7.0. 



(iii) Immunofusions Comprising SCA 

Cell-free supernatant was passed through a CM 

10 spheradex column (5x3 cm), equilibrated with 10 mM Na 

phosphate, pH 7.0. Single-chain antibody binds to the 
column which is then washed with 10 mM Na phosphate, 4 5 mM 
NaCl, pH 7.0. The fusion protein was then eluted with 10 
mM Naphosphate, 200 mM NaCl, pH 7.0. The eluate was 

15 diluted two-fold with 20 mM HEPES, 3 M ammonium sulfate, pH 

7.0 and loaded onto a butyl sepharose Fast Flow (Pharmacia) 
column (2.5 x 4.1 cm) equilibrated in 20 mM HEPES, 1.5 M 
ammonium sulfate, pH 7.0. The column was then washed with 
20 mM HEPES , 1.0 M ammonium sulfate, pH 7.0 and eluted with 

20 20 mM HEPES pH 7.0. The butyl sepharose eluate was 

concentrated to a volume of 2-4 ml in an Amicon stirred 
cell fitted with a YM10 membrane. The concentrated sample 
was loaded onto an S-200 (Pharmacia) column (3.2 x 38 cm) 
equilibrated in 10 mM Na phosphate, 150 mM NaCl, pH 7.0. 

25 The column was then run in the same buffer and the 
fractions were collected. Some of the fractions were 
analyzed by S0S-PA6E to determine which fractions to pool 
together for the final product. 



30 Activity of Gelonin Immunofusions 

A concern. in constructing immunofusions 
comprising any RIP is that the targeting and enzymatic 
activities of the components of the fusion protein may be 
lost as a result of the fusion. For example, attachment of 
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an RIP to the amino terminus of an antibody may affect the 
antigen-binding (complementarity-determining regions) of 
the antibody and may also result in steric hinderance at 
the active site. Similarly, the activity of an RIP may be 
5 hindered by attachment of an antibody or antibody portion. 

For example, RIPs chemically conjugated to antibodies via 
a disulfide bridge are typically inactive in the absence of 
reducing agents* In order to assess the foregoing in 
immunofusions of the present invention, such proteins were 
10 subjected to assays to determine their enzymatic, binding, 

and cytotoxic activities. 



A. Reticulocyte Lvsate Assay 

The enzymatic activity of immunofusions 
comprising gelonin was assayed using the reticulocyte 

15 lysate assay (RLA) describe above. As noted in Example 4, 

the RLA assay measures the inhibition of protein synthesis 
in a cell-free system using endogenous globin mRNA from a 
rabbit red blood cell lysate. Decreased incorporation of 
tritiated leucine ( 3 H-Leu) was measured as a function of 

20 toxin concentration. Serial log dilutions of standard 

toxin (the 30 kD form of ricin A-chain, abbreviated as RTA 
30) , native gelonin, recombinant gelonin (rGelonin or rGel) 
and gelonin analogs were tested over a range of 1 pg/ml to 
1 pg/ml. Samples were tested in triplicate, prepared on 

25 ice, incubated for 30 minutes at 37 *C, and then counted on 

an Inotec Trace 9 6 cascade i oni z at ion counter . By 
comparison with an uninhibited sample, the picomolar 
concentration of toxin (pM) which corresponds to 50% 
inhibition of protein synthesis (IC 50 ) was calculated. 

30 Representative data for various immunotoxins of 

the invention are shown below in Table 14 . 
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Table 14 







inuauj iu uuxin 


bCJt NO . 






r^ei • 


:KriA: :auA \ Vg- v^) 


A£113 6 


12 




rGel l 


: kwa : : sga ( v^— v a ; 


AB1137 


18 


5 


rGel: 


:SLT: :SCA(V H -V L ) 


AB1133 


26 




rGel: 


:RMA: :SCA(V L -V B ) 


AB1124 


33 




rGel: 


:RMA: : K+Fd ' (cH65Fab') 


AB1122 


54 




rGel: 


:SLT: :K+Fd(he3Fab) 


AB1160 


40 




rGel: 


:RMA: :K+Fd (he3Fab) 


AB1149 


33 


10 


rGel: 


:KMA: :Fd+K(he3Fab) 


AB1163 


14 




rGel: 


:Fd'+K(cH65Fab') 


AB1123 


45 



Contrary to the expectations discussed above, 
gelonin immunofusions of the invention exhibit enzymatic 
activity which is comparable to the activities of native 
15 and recombinant gelonin shown in Example 4. This was true 

for fusions made with either the reducible (SLT) or non- 
reducible (RMA) linkers • 

B. Binding Activity of Immunofusions 

Several immunofusions according to the present 

20 invention were assayed for their ability to compete with 

labelled antibody for binding to CD5-positive cells. The 
Kd of the immunofusions was estimated by three different 
means as shown in Table 15. The first Kd estimation (Kd x in 
Table 15) was obtained by competition with fluorescein- 

25 labelled H65 IgG for binding to MOLT-4X cells (ATCC CRL 

1582) according to the procedure reported in Knebel, et 
ai., cytometry Suppl., 2: 68 (1987), incorporated by 
reference herein. 

The second Kd measurement (Kd 2 in TablelS) was 

30 obtained by Scatchard analysis of competition of the 

immunofusion with 125 I-cH65 IgG for binding on M0LT-4M cells 
as follows. A 20 Mg aliquot of chimeric H65 IgG (cH65 IgG) 
was iodinated by exposure to 100 Ml lactoperoxidase-glucose 
oxidase immobilized beads (Enzymobeads, BioRad) , 100 Ml of 

35 PBS, l.o mCi I 123 (Amersham, IMS30) , 50 ^1 of 55 mM 
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b-D-glucose for 4 5 minutes at 2 3 °C. The reaction was 
quenched by the addition of 20 jxl of 105 mM sodium 
metabisulf ite and 120 mM potassium iodine followed by 
centrifugation for 1 minute to pellet the beads- 125 I-cH65 
5 IgG was purified by gel filtration using a 7 ml column of 

sephadex G25, eluted with PBS (137 mM NaCl, 1.47 mM KH 2 P0 4 , 
8.1 mM Na 2 HPO w 2.68 mM KC1 at pH 7.2-7.4) plus 0.1% BSA. 
1Z5 I-cH65 jg G recovery and specific activity were determined 
by TCA precipitation. 

10 Competitive binding was performed as follows: 

100 jil of Molt-4M cells were washed two times in ice-cold 
DHB binding buffer (Dubellco's modified Eagle's medium 
(Gibco, 320-1965PJ) , 1.0% BSA and 10 mM Hepes at pH 7.2 
-7.4). Cells were resuspended in the same buffer, plated 

15 into 96 v-bottomed wells (Costar) at 3 x 10 5 cells per well 

and pelleted at 4 0 C by centrifugation for 5 min at 1,000 
rpm using a Beckman JS 4.2 rotor; 50 Ml of 2X-concentrated 
0.1 nM ^I-cHeS IgG in DHB was then added to each well and 
competed with 50 jil of 2X - concentrated cH65 IgG in DHB at 

20 final protein concentrations from 100 nM to 0.0017 nM. The 

concentrations of assayed proteins were determined by 
measuring absorbance (A 2 « 0 and using an extinction 
coefficient of 1.0 for fusion proteins, 1.3 for Fab, and 
1.22 for Fab conjugated to gelonin. Also, protein 

25 concentrations were determined by BCA assay (Pierce 

Chemical) with bovine serum albumin as the standard. 
Binding was allowed to proceed at 4°C for 5 hrs and was 
terminated by washing cells three times with 200 m! of DHB 
binding buffer by centrifugation for 5 min. at 1,000 rpm. 

30 All buffers and operations were at 4°C. Radioactivity was 
determined by solubilizing cells in 100 jil of 1.0 M NaOH 
and counting in a Cobra II auto gamma counter (Packard) . 
Data from binding experiments were analyzed by the weighted 
nonlinear least squares curve fitting program, MacLigand, 

35 a Macintosh version of the computer program "Ligand" from 
Munson, Analyt. Biochem., 107:220 (1980), incorporated by 
reference herein* 
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Finally, the Kd (Kd 3 in the Table) was estimated 
by examination of the ED 50 values obtained from separate 
competition binding assays, performed as described in the 
previous paragraph. All three measurements are shown in 
Table 15 below: 



Table is 



MoIscuIp TVnp 




Kd ? 


Kd, 




1 • b 


ND 


ND 


CH65 IgG 


ND 


3.0 


2.5 


cH65Fab' 


4.0 


14.0 


ND 


cH65Fab'-rGel A30(C44) 


3.5 


13.0 


ND 


rGel: :RMA: : K+Fd ' (cH65Fab') 


16.0 


ND 


100 


he3Fab 


1.20 


2.60 


ND 


he3Fab-rGel A50<C44) 


1.10 


2.70 


ND 


rGel: :RMA: :K+Fd' (he3Fab) 


2.60 


ND 


5.0 


rGel: :SLT: :K+Fd(he3Fab) 


ND 


ND 


30 


SCA(V L -V a ) 


2.20 


ND 


30 


rGel: :RMA: :SCA(V H -VJ 


3.50 


ND 


20 


rGel: :RMA: :SCA(V L -V H ) 


4.70 


ND 


30 


SCA(V L -V H ) 


ND 


ND 


20 


rGel : :RMA: :SCA(V L -V H ) 


2.30 


ND 


ND 


ND = not determined 









The results presented in Table 15 suggest that 
Fab and SCA antibody forms may retain substantial binding 
25 activity even when fused to an RIP. 



c. ccaparatiyg cytotoxicity Assays 

Fusion proteins and immunoconjugates according to 
the present invention were used in a comparative cytoxicity 
assay. Two types of assays vera conducted, one targeting 
30 T cell line HSB2 , and the other targeting lect in-activated 
peripheral blood mononuclear cells (PBMC) according to 
procedures in Example 6. The results of the assays are 
presented below in Tables 16a, 16b and 16c. 
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The results presented in Tables 16a, 16b and 16c 
demonstrate that gelonin immunof usions may vary in their 
activity, in general, immunof usions of the invention which 
have IC 50 median or mean values of less than 2000 pM Toxin 
5 display strong activity; whereas those with IC 50 values 
equal to or less than 500 pM Toxin are considered highly 
active. In sum, the results in Tables 16a, 16b and 16c 
demonstrate that the optimum fusion protein for killing a 
particular cell line may vary depending upon the targeted 
10 cell. 

KWaple 23 

Preparation Of BRIP 

BRIP possesses characteristics which make it an 
attractive candidate for a component of immunotoxins . BRIP 

15 is a naturally unglycosylated protein that may have reduced 

uptake in the liver and enhanced circulatory residence time 
in vivo. Additionally, BRIP is less toxic and less 
immunogenic in animals than the A-chain of ricin. cloning 
of the BRIP gene and expression of recombinant BRIP in an 

20 E. coli expression system obviates the need to purify 
native BRIP directly from barley, and enables the 
development of analogs of BRIP which may be conjugated with 
an available cysteine residue for conjugation to 
antibodies . 

25 A. Purification Of BRIP And Generation 
Of Polyclonal Antibodies To BRIP 

Native BRIP was purified from pearled barley 

flour. Four kilograms of flour was extracted with 16 

liters of extraction buffer (10 mM NaP04, 25 mM NaCl, pH 

3 0 7.2) for 20 hours at 4"C. The sediment was removed by 
centrifugation, and 200 ml of packed S-Sepharose 
(Pharmacia, Piscataway, New Jersey) was added to absorb 
BRIP. After mixing for 20 hours at 4'C, the resin was 
allowed to settle out, rinsed several times with extraction 

35 buffer and then packed into a 2.6 x 40 cm column. once 
packed, the column was washed with extraction buffer (150 
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ml/h) until the absorbance of the effluent approached zero. 
BRIP was then eluted with a linear gradient of 0.025 to 0.3 
M NaCl in extraction buffer and 5 ml fractions were 
collected. BRIP-containing peaks (identified by Western 
5 analysis of column fractions) were pooled, concentrated to 
about 20 ml, and then chromatographed on a 2.6 x 100 cm 
Sephacryl S-200HR (Pharmacia) column equilibrated in 10 mM 
NaP0«, 125 mM NaCI, pH 7.4 (10 ml/hr) . BRIP-containing 
peaks were pooled again, concentrated, and stored at -70*C. 

10 T * e resulting purified BRIP protein had a 

molecular weight of about 30,000 Daltons, based upon the 
mobility of Coomassie-stained protein bands following SDS- 
PAGE. The amino acid composition was consistent with that 
published by Asano et ml., Carls berg Res . Comm., 45:619-626 

15 (1984). 

Rabbits were immunized with purified BRIP to 
generate polyclonal antisera. 

B. Cloning Of The BRTP Sana 

A cDNA expression library prepared from 
20 germinating barley seeds in the phage X expression vector 
XZAPII was purchased from Stratagene, La Jolla, CA. 
Approximately 700,000 phage plagues were screened with 
anti-BRlP polyclonal antisera and 6 immunoreactive plaques 
were identified. One plaque was chosen, and the cDNA 
25 contained therein was excised from XZAPII with EcoRl and 
subcloned into pUC18 generating the vector pBSl. The cONA 
insert was sequenced with Sequenase (United States 
Biochemical, Cleveland, Ohio) . The DNA sequence of the 
native BRIP gene is set out in SEQ ID NO: 12. To confirm 
30 that cDNA encoded the native BRIP gene, the cDNA was 
expressed in the E. coli plasmid pKX233-2 (Pharmacia) . 
BRIP protein was detected in IPTG-induced cells transformed 
with the plasmid by Western analysis with above-described 
rabbit ant i -BRIP antisera. 
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C. Construction Of An £. coli Expression 
Vector containing The BRIP Gene 

Barley cDNA containing the BRIP gene was linked 

to a pelB leader sequence and placed under control of an 

5 araB promoter in a bacterial secretion vector. 

An intermediate vector containing the BRIP gene 

linked to the pelB leader sequence was generated. Plasmid 

pBSl was cut with Ncol, treated with Mung Bean Nuclease, 

cut with BamHI and the 760 bp fragment corresponding to 

10 amino acids 1-256 of BRIP was purified from an agarose gel. 

Concurrently, a unique Xhol site was introduced downstream 
of the 3' -end of the BRIP gene in pBSl by PCR amplification 
with a pUC18 vector primer (identical to the Reverse* 
primer sold by NEB or BRL but synthesized on a Cyclone 

15 Model 8400 DNA synthesizer) and the specific primer BRIP 

3 ' Xho . The sequence of each of the primers is set out 
below. 

Reverse (SEQ ID NO: 45) 
5" AACAGCTATGACCATG 3* 

20 BRIP 3'Xho (SEQ ID NO: 46) 

5 f TGAACTCGAGGAAAACTACCTATTTCCCAC 3 • 
Primer BRIP 3 9 Xho includes a portion corresponding to the 
last 8 bp of the BRIP gene, the termination codon and 
several base pairs downstream of the BRIP gene, and an 

25 additional portion that introduces a Xhol site in the 

resulting PCR fragment. The PCR reaction product was 
digested with BamHI and Xhol, and an 87 bp fragment 
containing the 3' -end of the BRIP gene was purified on a 5% 
acrylamide gel* The 760 and 87 bp purified BRIP fragments 

30 were ligated in the vector pING1500 adjacent to the pelB 
leader sequence. pING1500 had previously been cut with 
SstI, treated with T4 polymerase, cut with Xhol, and 
purified. The DNA sequence at the junction of the pelB 
leader and the 5* -end of the BRIP gene was verified by DNA 

35 sequence analysis. This vector was denoted pING3321-l. 

The final expression vector was assembled by 
placing the BRIP gene under the control of the inducible 
araB promoter. Plasmid pING3321-l was cut with PstI and 
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Xhol, and the BRIP gene linked to the pelB leader was 
purified from an agarose gel. The expression vector 
PING3 217, containing the araB promoter, was cut with PstI 
and Xhol and ligated to the BRIP gene. The expression 
vector was denoted pING3322. 

Arabinose induction of s. coli cells containing 
the plasmid pING3322 in a fermenter resulted in the 
production of about 100 mg per liter of recombinant BRIP. 
E. coli-produced BRIP displays properties identical to BRIP 
purified directly from barley seeds. 

D. Construction Of BRIP Analogs 
With A Free Cysteine Residue 

The BRIP protein contains no cysteine residues, 
and therefore contains no residues directly available which 
may form a disulfide linkage to antibodies or other 
proteins. Analogs of recombinant BRIP were generated which 
contain a free cysteine residue near the C-terminus of the 
protein. Three residues of the BRIP protein were targets 
for amino acid substitutions. Comparison of the amino acid 
sequence of BRIP to the known tertiary structure of the 
ricin A-chain (see FIG. 2) suggested that the three 
positions would be available near the surface of the 
molecule. The three BRIP analogs include cysteines 
substituted in place of serine 277 , alanine 270 , and leucine 256 
of the native protein, and were designated BRIP^ (SEQ ID 
NO: 127), BRIPc2 70 (SEQ ID NO: 128) and BRIP^ (SEQ ID NO: 
129) , respectively. 

(1) A plasmid vector capable of expressing the 
BRIP C277 analog was constructed by replacing the 3 f -end of 
the BRIP gene with a DNA segment conferring the amino acid 
change. The EcoKI fragment containing the BRIP gene from 
pBSl was subcloned into M13mpl8, and single-stranded DNA 
(anti-sense strand) was amplified by PCR with primers 0BM2 
(corresponding nucleotides -11 to +8 of the BRIP gene) and 
OMB4 (corresponding to amino acids 264-280 of BRIP and the 
termination codon of BRIP, and incorporating the 
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substitution of a cysteine codon for the native codon for 
serine 277 of native BRIP) . The sequences of primers OBM2 and 
0MB4, wherein the underlined nucleotides encode the 
substituted cysteine, are set out below. 

OBM2 (SEQ ID NO: 47) 
5 1 GCATTACATCCATGGCGGC 3 1 
OMB4 (SEQ ID NO: 48) 

5 1 GATATCTCGAGTTAACTATTTCCCACCACACG 
CATGGAACAGCTCCAGCGCCTTGGCCACCGTC 3 1 
A fragment containing a BRIP gene in which the codon for 
the amino acid at position 277 was changed to a cysteine 
codon was amplified. The fragment was cloned into the SmaX 
site of pUC19 (BRL) and the plasmid generated was denoted 
pMB22. pMB22 was digested with ScoRI and an EcoRI-XhoI 
15 linker (Clonetech, Palo Alto, CA) was ligated into the 

vector. Subsequent digestion with Xhol and religation 
generated vector pINGMB2X. A BamHI to Xhol fragment 
encoding the 3 '-end of BRIP with the altered amino acid was 
excised from pMB2X and the fragment was purified on a 5% 
20 acrylamide gel. This fragment along with an ScoRI to BamHI 

fragment containing the pelB leader sequence and sequences 
encoding the first 256 amino acids of BRIP were substituted 
in a three piece ligation into pING3322 cut with EcoRl and 
Xhol. The resulting vector containing the BRIP^ analog 
25 was designated pING3803 (ATCC Accession No. 68722). 

(2) A BRIP analog with a free cysteine at 
position 256 was constructed using PCR to introduce the 
amino acid substitution. A portion of the expression 
plasmid pING3322 was amplified with primers BRIP-256 and 
30 2FZKDXXX-2. The sequence of each primer is set out below. 

BRIP-256 (SEQ ID NO: 49) 
5« ISICTGTT CGTGG AGGTG C CG 3 • 
HJ/raiII-2 (SEQ ID NO: 44) 
5 f CGTTAGCAATTTAACTGTGAT 3» 
35 Nucleotides 4-21 of primer BRIP-256 encode amino acids 256- 

262 of BRIP while the underlined nucleotides specify the 
cysteine to be substituted for the leucine at the 
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corresponding position of the native BRIP protein. Primer 
HINDI II- 2 corresponds to a portion of the plasmid. The PCR 
product, which encodes the carboxyl terminal portion of the 
BRIP analog, was treated with T4 polymerase, cut with Xhol, 
5 and the resulting fragment was purified on a 5% acrylamide 

gel. Concurrently, plasmid pING3322 was cut with BamHI, 
treated with T4 polymerase, cut with EcoRI, and the 
fragment containing the pelB leader sequence and sequences 
encoding the first 256 amino acids of BRIP was purified* 

10 The two fragments were then assembled back into pING3322 to 

generate the gene encoding the analog BRIP C2J6 . This plasmid 
is denoted pING3801. 

(3) A BRIP analog with a cysteine at position 
270 was also generated using PCR, A portion of the 

15 expression plasmid pING3322 was amplified with primers 

BRIP-270 and the HJNDIII-2 primer (SEQ ID NO: 44). The 
sequence of primer BRIP-270 is set out below. 

BRIP-270 (SEQ ID NO: 50) 

5' CCAAGTGTCTGGAGCTGTTCCATGCGA 3' 

20 Primer BRIP-270 corresponds to amino acids 268-276 of BRIP 

with the exception of residue 270. The codon of the primer 
corresponding to position 270 specifies a cysteine instead 
of the alanine present in the corresponding position in 
native BRIP. The PCR product was treated with T4 

25 polymerase, cut with Xhol, and the 51 bp fragment, which 

encodes the carboxyl terminal portion of the analog, was 
purified on a 5% acrylamide gel. The fragment 
(corresponding to amino acids 268-276 of BRIP^o) was cloned 
in a three piece ligation along with the internal 151 bp 

30 BRIP restriction fragment from Sstll to MscI {corresponding 
to BRIP amino acids 217-267) from plasmid pING3322, and 
restriction fragment from Sstll to Xhol from pING3322 
containing the remainder of the BRIP gene. The plasmid 
generated contains the gene encoding the BRIP C2 7o analog and 

35 is designated pING3802. 
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E. Purification Of Recombinant 
BRI? And The BRI? Analogs 

Recombinant BRIP (rBRIP) and the BRIP analogs 

with free cysteine residues were purified essentially as 

5 described for native BRIP except they were prepared from 

concentrated fermentation broths. For rBRIP , concentrated 

broth from a 10 liter fermentation batch was exchanged into 

10 mM Tris, 20 mM NaCl pH 7.5, loaded onto a Sephacryl S- 

200 column, and eluted with a 20 to 500 mM NaCl linear 

10 gradient. Pooled rBRIP was further purified on a Blue 

Toyopearl® column (TosoHaas) loaded in 20 mM NaCl and 
eluted in a 20 to 500 mM NaCl gradient in 10mM Tris, pH 
7.5. For BRIP analogs, concentrated fermentation broths 
were loaded onto a CM52 column (Whatman) in 10 mM phosphate 

15 buffer, pH 7.5, and eluted with a 0 to 0.3M NaCl linear 

gradient. Further purification was by chromatography on a 
Blue Toyopearl® column. 



F. Reticulocyte Lvsate Assay 

The ability of the rBRIP and the BRIP analogs to 

20 inhibit protein synthesis in vitro was tested by 

reticulocyte lysate assay as described in Example l. 
Serial log dilutions of standard toxin (RTA 30) , native 
BRIP, rBRIP and BRIP analogs were tested over a range of 1 
Mg/al to 1 pg/ml. By comparison with an uninhibited 

25 sample, the picomolar concentration of toxin (pM) which 
corresponds to 50* inhibition of protein synthesis (IC 30 ) 
was calculated. The results of the assays are presented 
below in Table 17. 
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Table 17 



Toxin K w (r>K) 

RTA 30 3.1 

Native BMP 15 

5 rBRIP 18 

BRIP C256 23 

. BRIP C2 7o 20 

BRIP C277 24 



The RLA results indicate that the BRIP analogs 
10 exhibit ribosome-inactivating activity comparable to that 

of the recombinant and native BRIP toxin. All the analogs 
retained the natural ability of native BRIP to inhibit 
protein synthesis, suggesting that amino acid substitution 
at these positions does not affect protein folding and 
15 activity. 

constructio n of BRIP Immunoconiuaates 

Immunoconjugates of native BRIP (SEQ ID NO: 3) 
with 4A2 (described in Morishima et al., J*. Immunol., 

20 129:1091 (1982) and H65 antibody (obtained from hybridoma 

ATCC HB9286) which recognize the T-cell determinants CD7 
and CDS , respectively, were constructed. Immunoconjugates 
of ricin A-chains (RTAs) with 4A2 and H65 antibody were 
constructed as controls. The H65 antibody and ricin A- 

25 chains as well as the RTA immunoconjugates were prepared 

and purified according to methods described in U.S. Patent 
Application Serial No. 07/306,433 supra and in 
International Publication No. WO 89/06968. 

To prepare immunoconjugates of native BRIP, both 

30 the antibody (4A2 or H65) and native BRIP were chemically 

modified with the hindered linker 5-methyl-2-iminothiolane 
(M2IT) at lysine residues to introduce a reactive 
sulfhydryl group as described in Goff et el., Bioconjugate 
Chem., 2:381-386 (1990). BRIP (3 mg/ml) was first 

35 incubated with 0.5 mM M2IT and 1 mM DTNB in 25 mM 
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triethanolamine, 150 mM NaCl, pH 8.0, for 3 hours at 25 # c. 
The derivitized BRIP- (M2IT) -S-S-TNB was then desalted on a 
column of Sephadex GF-05LS and the number of thiol groups 
introduced was quantitated by the addition of o.i mM DTT. 
5 On average, each BRIP molecule contained 0.7 SH/mol. 

4A2 or H65 antibody (4 mg/ml) in triethanolamine 
buffer was similarly incubated with M2IT {0.3 mM) and DTNB 
(1 mM) for 3 hours at 25*C. Antibody- (M2IT) -S-S-TNB was 
then desalted and the TNB: antibody ratio was determined. 
10 To prepare the conjugate, the BRIP-(M2IT) -S-S-TNB was first 
reduced to BRIP- (M2IT) -SH by treatment with 0.5 mM DTT for 
1 hour at 25 # C, desalted by gel filtration of Sephadex* GF- 
05LS to remove the reducing agent, and then mixed with 
antibody- (M2 IT) -S-S-TNB . 
15 Following a 3 hour incubation at 25 'C, and an 

additional 18 hours at 4*C, the conjugate was purified by 
sequential chromatography on AcA44 (IBF) and Blue 
Toyopearl*. Samples of the final product were run on 5% 
non-reducing SDS PAGE, Coomassie stained, and scanned with 
20 a Shimadzu laser densitometer to quantitate the number of 

toxins per antibody. 

The BRIP analogs containing a free cysteine were 
also conjugated to 4A2 and H65 antibodies. The analogs 
were treated with 50 mM DTT either for 2 hours at 25 *C or 
25 for 18 hours at 4*C to expose the reactive sulfhydryl group 

of the cysteine and desalted. The presence of a free 
sulfhydryl was verified by reaction with DTNB [Ellman et 
al., Arch. BlochBm. Biophys, 82:70-77 (1959)]. 4A2 or H65 
antibody derivatized as described above with M2IT was 
30 incubated with the reduced BRIP analogs at a ratio of 1:5 

at room temperature for 3 hours and then overnight at 4*C. 
Immunoconjugates HSS-BRIP^, 4A2-BRIP C236 , HSS-BRIP^ were 
prepared in 25 mM triethanolamine, 150 mM NaCl pH 8, while 
immunocon j ugates H6 5 -BRIP^o , 4 A2 -BRIPc2 70 and 4 A2 -BRIP C277 
35 were prepared in 0.1 M sodium phosphate, 150 mM NaCl pH 
7.5. Following conjugation, 10 mM mercaptoethyl amine was 
added for 15 minutes at 25 'C to quenched any unreacted m2IT 
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linkers on the antibody. The quenched reaction solution 
was promptly loaded onto a gel filtration column (AcA44) to 
remove unconjugated ribosome-inactivating protein. 
Purification was completed using soft gel affinity 
chromatography on Blue Toyopearl* resin using a method 
similar to Knowles et al., Analyt. Blochem., 160 i 440 
(1987). Samples of the fin^l product were run on 5% non- 
reduced SDS PAGE, Coomassie stained, and scanned with a 
Shimadzu laser densitometer to quantitate the number of 
toxins per antibody. The conjugation efficiency was 
substantially greater for BRIP C277 (78%) than for either of 
the other two analogs, BRIP C270 and BRIP C256 (each of these was 
about 10%) . Additionally, the BRIP C277 product was a 
polycon jugate, i.e., several BRIP molecules conjugated to 
a single antibody, in contrast to the BRIP C270 and BRIP C236 
products which were monoconjugates. 



example 24 

Properties Of BRIP Imnunoconiuqates 
A. Whole Cell Kill Assay 

Immunoconjugates of native BRIP and of the BRIP 
analogs were tested for the ability to inhibit protein 
synthesis in HSB2 cells by the whole cell kill assay 
described in Example 1. Standard immunoconjugates H65-RTA 
(H65 derivatized with SPDP linked to RTA) and 4MRTA (4A2 
antibody derivatized with M2IT linked to RTA) and BRIP 
immunocon jugate samples were diluted with RPMI without 
leucine at half -log concentrations ranging from 2000 to 
0.632 ng/ml. All dilutions were added in triplicate to 
microtiter plates containing 1 x 10 5 HSB2 cells. HSB2 
plates were incubated for 20 hours at 37 *C and then pulsed 
with 3 H-Leu for 4 hours before harvesting. Samples were 
counted on the Inotec Trace 96 cascade ionization counter. 
By comparison with an untreated sample, the picomolar toxin 
concentration (pM T) of immunocon jugate which resulted in 
a 50% inhibition of protein synthesis (IC 50 ) was calculated. 
The assay results are presented below in Table 18. 
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Conjugate 

4A2-BRIP 

4A2-BRIP C270 

4A2-BRIP C277 

4A2-BRIP C25S 

H65-BRIP 4 

H65-BRIP C277 
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Table 18 

122.45 
46.3 
57.5 
1116 
>5000 
1176 



20 



The BRIP analog conjugates were less potent than 
10 the ricin conjugate control (data not shown) . The 

immunotoxins containing antibody 4A2 and either the BRIP C270 
or the BRIP C277 analog exhibited comparable to increased 
specific cytotoxicity toward target cells as compared to 
immunotoxin containing native BRIP. While 4A2-BRIP C256 is 
15 less active than 4A2-BRIP, Ate-BRIP^ and 4A2-BRIP C277 are 

between 3 and 4 times more active. Similarly, the 
immunocon jugate of H65 to BRIP^y shows greater toxicity 
toward target cells than the immunocon jugate of H65 to 
native BRIP. Thus, linkage of antibody to BRIP derivatives 
which have an available cysteine residue in an appropriate 
location results in immunotoxins with enhanced specific 
toxicity toward target cells relative to conjugates with 
native BRIP. 



B. Disulfide Bond Stability Assay 

25 Immunocon jugates prepared with native BRIP and 

the BRIP analogs were examined by the disulfide bond 
stability assay described in Example 1. Briefly, 
conjugates were incubated with increasing concentrations of 
glutathione for 1 hour at 37 *C and, after terminating the 

30 reaction with iodoacetamide, the amount of RIP released was 

quantitated by size-exclusion HPLC on a TosoHaas TSK- 
G2000SW column. 

By comparisons with the amount of RIP released by 
high concentrations of 2-mercaptoethanol (to determine 100% 

35 release) , the concentration of glutathione required to 
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release 50% of the RIP (the RC 50 ) was calculated. As shown 
below in Table 19, the conjugates prepared with BRIP C270 or 
BRIPci^ were significantly more stable than either the rta 
conjugates or those prepared with native BRIP. 



Table 19 



Conjugate RC 5G _QnJH 

H65-RTA 7.0 

H65-BRIP 2.8 

H65-BRIPC277 196 . 0 

10 4A2-RTA 4.4 

4A2-BRIP 3.3 

42-BRIP C270 53 .0 

4A2-BRIP C277 187.0 



These unexpected results suggest that conjugates prepared 
with Type I RIP analogs according to the present invention 
may have enhanced stability and efficacy in vivo. 



Example 25 

Preparation of Momordin and Analogs Thereof 

Plants of the genus Momordica produce a number of 
20 related proteins known as momordins or momorcharins which 

are Type I RIPs. The gene encoding momordin II was cloned 
from Momordica balsamina seeds. 



A. Pygparqtipn Qg telswin* RJ*A 

Total RNA was prepared from 4 g of tf. balsamina 

25 seeds as described in Ausubel et al., supra. PolyA 
containing RNA was prepared from 1 mg of total RNA by 
chromatography on oligo-(dT) -cellulose. 40 mg of oligo- 
(dT) -cellulose Type 7 (Pharmacia) was added to 0.1 N NaOH 
and poured into a disposable column (Biorad) • The column 

30 was washed with water until the eluate was pH 5.5, and then 
was washed with IX loading buffer (50 mM Na Citrate, 0.5M 
NaCl, 1 mM EDTA, 0.1% SDS, pH 7.0) until the eluate was pH 
7.0. 1 mg of total RKA was suspended in 300 Ml of water, 
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heated to 65 'c for 5 minutes, and 3 00 M l of 2X loading 
buffer was added (100 mM Na Citrate, im NaCl, 2 inM edta, 
and 0.2% SDS). .The RNA was loaded onto the column, and the 
-flow through was reheated to 65 'c, cooled to room 
5 temperature, and reloaded onto the column. Column-bound 

mRNA was washed 5 times with 0.5 ml of ix loading buffer, 
and pwo times with 0.5 ml of 0.05M Nacitrate, 0.1 M NaCl, 
1 mM EDTA, 0.1% SDS. PolyA- containing RNA was eluted two 
times from the column with 0.5 ml of 25 mM Nacitrate, l mM 
10 EDTA, and 0.05% SDS. 



B. Library Preparation 

A cDNA library from the polyA-containing m. 
balsamlna RNA was prepared in a bacterial expression 
plasmid with the Superscript Plasmid System (BRL, 
Gaithersburg, Maryland) . The cDNA was synthesized from 2 
Mg of poly A-containing RNA, size fractionated, digested 
with NotI, and ligated into the expression vector pSPORT as 
recommended by the manufacturer of the vector, BRL. 



C. Cloning Of The Momordin II Gene 

20 A DNA fragment encoding the first 27 amino acids 

of momordin II was amplified from M. balsamlna cDNA by PGR. 

First strand cDNA was prepared from 100 ng of polyA 
containing RNA with an RNA-PCR Kit (Perkin Elmer Cetus) . 
Two partially degenerate primers were synthesized based on 

25 the amino acid sequence of the first 27 amino acids of 

momordin II described in Li et aJ., Sxperientla, 36:524-527 
(1980) . Because the amino acid sequence of amino acids 1- 
27 of momordin II is 52% homologous to amino acids 1-17 of 
momordin I [Ho et al., SBA, 1083:311-314 (1991)], some 

30 codon assignments in the degenerate primers were based on 
homology to the corresponding amino acid as well as codon 
preference in the momordin I gene. The sequences of 
primers momo-3 and momo-4 are set out: below using IUPAC 
nucleotide symbols. 
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momo-3 (SEQ ID NO: 51) 
5* GATGTTAAYTTYGAYTTGTCNACDGCTAC 3' 
momo-4 (SEQ ID NO: 52) 
5 * ATTGGNAGDGTAGCCCTRAARTCYTCDAT 3 ' 
5 The resulting 81 bp PCR product was purified on a 5% 

acrylamide gel and cloned into the Sinai site of pUClS. 
Three candidate clones were sequenced, and one clone, 
pMOHO, was identified which encoded the N-terminal 27 
amino acids of momordin II. 
10 A hybridization probe was designed for screening 

of the momordin II cDNA library based on the sequence of 
the pMOHO momordin II DNA fragment- The sequence of the 
primer momo-5 is shown below* 

momo-5 (SEQ ID NO: 53) 
15 5» GCCAC TGCAAAAACCTACACAAAATTTA TTGA 3' 

Primer momo-5 corresponds to amino acids 9-18 of mature 
momordin II. The underlined nucleotides of the primer were 
expected to match the DNA sequence of the momordin II gene 
exactly. Since this sequence is highly A/T-rich and may 
20 hybridize to the momordin II gene weakly, the additional 

adjacent nucleotides were included in the primer* Bases 3 
and 30 (overlined) were in the "wobble" position (i.e., the 
third nucleotide in a codon) of amino acids 9 (alanine) and 
18 (isoleucine) , respectively, of momordin II and may not 
25 be identical to the nucleotide bases in the native gene. 

A 90,000 member cDNA library in pSPORT was 
screened with 32 P-kinased momo-5, and eight potential 
candidate clones were identified. One clone, pING3619, was 
sequenced and contains an open reading frame corresponding 
30 in pari: to the expected N-terminal 27 residues of Momordin 
II. The complete momordin gene contains 286 amino acids, 
the first 23 of which are a presumed leader signal (mature 
momordin II is 263 residues) . The DNA sequence of the 
momordin II gene is set out in SEQ ID NO: 13. 
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D. Construction Of An Expression 

Vector Containing The M omordin r.a T1f 

A bacterial expression vector for the momordin n 
gene was constructed . Two PCR primers were synthesized, 
5 one (momo-9) which primes from the +1 residue of the mature 

momordin II amino acid sequence, and one at the C-terminus 
(momo-10) of momordin II which introduces an xhol 
restriction site: 

momo-9 (SEQ ID NO: 54) 
0 5 ' GATGTTAACTTCGATTTGTCGA 3 • 

momo-10 (SEQ ID NO: 55) 

5* TCAACTCGAGGTACTCAATTCACAACAGATTCC 3* 
PING3619 was amplified with momo-9 and momo-io, and the 
product was treated with T4 polymerase, cut with Xhol, and 

5 purified on an agarose gel. This gene fragment was ligated 

along with the 131 bp pelB leader fragment from picioa 
which has been generated by SstI digestion, T4-polymerase 
treatment, and EcoRl digestion, into the araB expression 
vector cleaved with ScoRl and Xhol. The product of this 

► three piece ligation was sequenced to verify that the pelB 

junction and momordin II coding sequence were correct. 
Arabinose induction of cells containing the momordin II 
expression plasmid pING3621 results in production of 
momordin II in E. colx. 



E. Analogs Of Momordin TT 

Mormordin II has no natural cysteines available 
for conjugation to antibody. Analogs of momordin which 
have a free cysteine for conjugation to an antibody may be 
constructed. Positions likely to be appropriate for 
substitution of a cysteine residue may be identified from 
Figure 3 as positions near the ricin A-chain cysteine 23 , and 
as positions including the last 26 amino acids of momordin 
II that are accessible to solvent. For example, the 
arginine at position 242 of momordin II aligns with the 
ricin A-chain cysteine at position 259 and is a preferred 
target for substitution. Additional preferred substitution 
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positions for momordin II include the serine at position 
241 and the alanine at position 243. 

While the present invention has been described in 
terms of preferred embodiments, it is understood that 
variations and improvements will occur to those skilled in 
the art. Therefore, it is intended that the appended 
claims cover all such equivalent variations which come 
within the scope of the invention as claimed. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

( i ) ^ APPLICANT : Better, Marc D. 

Carroll, Stephen F. 
Studnicka, Gary M. 

(ii) TITLE OF INVENTION: Fusion Proteins and Polynucleotides Encoding 

Gelonin Sequences 

(iii) NUMBER OF SEQUENCES: 173 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: McAndrews , Held & Malloy, Ltd. 

(B) STREET: 500 W. Madison Street, 34 ch Floor 

(C) CITY: Chicago 
CD) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP : 60661 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: not yet assigned 

(B) FILING DATE: ll-AUG-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/646,360 
<B) FILING DATE: 13-MAY-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US94/05348 

(B) FILING DATE: 12-MAY-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/064,691 

(B) FILING DATE: 12 -MAY- 1993 

(vii) PRIOR* APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/988,430 

(B) FILING DATE: 09-DEC-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/901,707 

(B) FILING DATE: 19-JUN-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/787,567 

(B) FILING DATE: 04 -NOV- 1991 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: McNicholas, Janet M. 
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(B) REGISTRATION NUMBER: 32,918 

(C) REFERENCE / DOCKET NUMBER: 200-70. P4 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312/707-8889 

(B) TELEFAX: 312/707-9155 

(C) TELEX: 650 388-1248 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

lie Phe Pro Lys Gin Tyr Pro lie lie Asn Phe Thr Thr Ala Gly Ala 
1 5 10 is 



Thr Val Gin Ser Tyr Thr Asn Phe He Arg Ala Val Arg Gly Arg Leu 
20 25 30 



Thr Thr Gly Ala Asp Val Arg His Glu He Pro Val Leu Pro Asn Arg 
35 40 45 



Val Gly Leu Pro He Asn Gin Arg Phe He Leu Val Glu Leu Ser Asn 
50 55 60 



His Ala Glu Leu Ser Val Thr Leu Ala Leu Asp Val Thr Asn Ala Tyr 
65 70 75 80 



Val Val Gly Tyr Arg Ala Gly Asn Ser Ala Tyr Phe Phe His Pro Asp 
85 90 95 



Asn Gin Glu Asp Ala Glu Ala He Thr His Leu Phe Thr Asp Val Gin 
100 105 110 



Asn Arg Tyr Thr Phe Ala Phe Gly Gly Asn Tyr Asp Arg Leu Glu Gin 
115 120 125 



Leu 



Ala Gly Asn Leu Arg Glu Asn He Glu Leu Gly Asn Gly Pro Leu 
130 135 140 



# 
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Glu Glu Ala He Ser Ala Leu Tyr Tyr Tyr Ser Thr Gly Gly Thr Gin 
145 150 155 160 



Leu Pro Thr Leu Ala Arg Ser Phe He He Cys He Gin Met He Ser 
165 170 175 



Glu Ala Ala Arg Phe Gin Tyr He Glu Gly Glu Met Arg Thr Arg He 
180 185 190 



Arg Tyr Aan Arg Arg Ser Ala Pro Asp Pro Ser Val He Thr Leu Glu 
195 200 205 



Asn Ser Trp Gly Arg Leu Ser Thr Ala He Gin Glu Ser Asn Gin Gly 
210 215 220 



Ala Phe Ala Ser Pro He Gin Leu Gin Arg Arg Asn Gly Ser Lys Phe 
225 230 235 240 



Ser Val Tyr Asp Val Ser He Leu He Pro He He Ala Leu Met Val 
245 250 255 



Tyr Arg Cys Ala Pro Pro Pro Ser Ser Gin Phe 
260 265 

(2) INFORMATION FOR SEQ ID NO:2: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Gly Leu .Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
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5 ° 55 60 



Glu lie Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr Xle Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 no 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lya Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
15 10 15 



Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe lie Ala Gly 
20 25 30 



lie Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 



Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 50 



Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala lie 
65 70 75 80 



Arg Ala Asp Asn lie Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 
85 90 95 



Trp Trp Glu Leu Thr Pro Gly Leu lie Pro Gly Ala Thr Tyr Val Gly 
100 105 110 



Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu Thr 
115 120 125 



Asn Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 



His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lys Gin Gin Gin 
145 150 155 160 



Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 
165 170 175 



Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 



# 
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Ala Val Glu Lys Lys Ser Gly Lys lie Gly Asn Glu Met Lys Ala Gin 
195 200 205 



Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 



Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro lie Glu Lys 
225 230 235 240 



Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Leu 
245 250 255 



Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu 
260 265 270 



Leu Phe His Ala Ser Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asp Val Asn Phe Asp Leu Ser Thr Ala Thr Ala Lys Thr Tyr Thr Lys 
15 10 15 



Phe He Glu Asp Phe Arg Ala Thr Leu Pro Phe Ser His Lys Val Tyr 
20 25 30 



Asp He Pro Leu Leu Tyr Ser Thr He Ser Asp Ser Arg Arg Phe He 
35 40 45 



Leu Leu Asp Leu Thr Ser Tyr Ala Tyr Glu Thr He Ser Val Ala He 
50 55 60 



Asp Val Thr Asn Val Tyr val Val Ala Tyr Arg Thr Arg Asp Val Ser 
65 70 75 30 



Tyr Phe Phe Lys Glu Ser Pro Pro Glu Ala Tyr Asn He Leu Phe Lys 
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85 90 95 



Gly ^Thr Arg Lys lie Thr Leu Pro Tyr Thr Gly Aan Tyr Glu Asn Leu 
100 105 no 



Gin Thr Ala Ala His Lys He Arg Glu Asn He Asp Leu Gly Leu Pro 
115 120 125 



Ala Leu Ser Ser Ala He Thr Thr Leu Phe Tyr Tyr Asn Ala Gin Ser 
130 135 140 



Ala Pro Ser Ala Leu Leu Val Leu He Gin Thr Thr Ala Glu Ala Ala 
145 150 155 iso 



Arg Phe Lys Tyr He Glu Arg His Val Ala Lys Tyr Val Ala Thr Asn 
165 170 175 



Phe Lys Pro Asn Leu Ala He He Ser Leu Glu Asn Gin Trp Ser Ala 
180 185 150 



Leu Ser Lys Gin He Phe Leu Ala Gin Asn Gin Gly Gly Lys Phe Arg 
195 200 205 



Asn Pro Val Asp Leu He Lys Pro Thr Gly Glu Arg Phe Gin Val Thr 
210 215 220 



Asn Val Asp Ser Asp Val Val Lys Gly Asn He Lys Leu Leu Leu Asn 
225 - 230 235 240 



Ser Arg Ala Ser Thr Ala Asp Glu Asn Phe He Thr Thr Met Thr Leu 
245 250 255 



Leu Gly Glu Ser Val Val Asn 
260 

INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Asp Val Arg Phe Ser Leu Ser Gly Ser Ser Ser Thr Ser Tyr Ser Lys 
15 10 is 



Phe lie Gly Asp Leu Arg Lys Ala Leu Pro Ser Asn Gly Thr Val Tyr 

" ( ' 20 25 30 



Asn Leu Thr lie Leu Leu Ser Ser Ala Ser Gly Ala Ser Arg Tyr Thr 
35 40 45 



Leu Met Thr Leu Ser Asn Tyr Asp Gly Lys Ala lie Thr Val Ala Val 
50 55 60 



Asp Val Ser Gin Leu Tyr lie Met Gly Tyr Leu Val Asn Ser Thr Ser 

65 70 75 80 



Tyr Phe Phe Asn Glu Ser Asp Ala Lys Leu Ala Ser Gin Tyr Val Phe 
35 90 95 



Lys Gly Ser Thr He Val Thr Leu Pro Tyr Ser Gly Asn Tyr Glu Lys 
100 105 110 



Leu Gin Thr Ala Ala Gly Lys He Arg Glu Lys He Pro Leu Gly Phe 
115 120 125 



Pro Ala Leu Asp Ser Ala Leu Thr Thr He Phe His Tyr Asp Ser Thr 
130 135 140 



Ala Ala Ala Ala Ala Phe Leu Val He Leu Gin Thr Thr Ala Glu Ala 
145 150 155 160 



Ser Arg Phe Lys Tyr He Glu Gly Gin He He Glu Arg He Ser Lys 
165 170 175 



Asn Gin Val Pro Ser Leu Ala Thr He Ser Leu Glu Asn Ser Leu Trp 
180 185 190 
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Ser Ala Leu Ser Lys Gin He Gin Leu Ala Gin Thr Asn Asn Gly Thr 
195 200 205 



Phe Lys Thr Pro Val Val He Thr Asp Asp Lys Gin Gin Arg Val Glu 
210 215 220 



He Thr Asn Val Thr Ser Lys Val Val Thr Lys Asn He Gin Leu Leu 
225 230 235 240 



Leu Asn Tyr Lys Gin Asn Val Ala 
245 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Asp Val Ser Phe Arg Leu Ser Gly Ala Thr Ser Ser Ser Tyr Gly Val 
15 10 15 



Phe He Ser Asn Leu Arg Lys Ala Leu Pro Asn Glu Arg Lys Leu Tyr 
20 25 30 



Asp He, Pro Leu Leu Arg Ser Ser Leu Pro Gly Ser Gin Arg Tyr Ala 
35 40 45 



Leu He His Leu Thr Asn Tyr Ala Asp Glu Thr He Ser Val Ala He 
50 55 60 



Asp Val Thr Asn Val Tyr He Met Gly Tyr Arg Ala Gly Asp Thr Ser 
65 70 75 80 



Tyr Phe Phe Asn Glu Ala Ser Ala Thr Glu Ala Ala Lys Tyr Val Phe 
85 90 95 



Lys Asp Ala Met Arg Lys Val Thr Leu Pro Tyr Ser Gly Asn Tyr Glu 
100 105 110 
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Arg Leu Gin Thr Ala Ala Gly Lys lie Arg Glu Asn He Pro Leu Glv 
US 120 125 



Leu Pro Ala Leu Asp Ser Ala He Thr Thr Leu Phe Tyr Tyr Asn Ala 
13 ° 135 140 



Asn Ser Ala Ala Ser Ala Leu Met Val Leu He Gin Ser Thr Ser Glu 

145 !50 155 160 

Ala Ala Arg Tyr Lys Phe He Glu Gin Gin He Gly Lys Arg Val Asp 

165 170 175 



Lys Thr Phe Leu Pro Ser Leu Ala He He Ser Leu Glu Asn Ser Trp 
180 185 190 



Ser Ala Leu Ser Lys Gin He Gin He Ala Ser Thr Asn Asn Gly Gin 
195 200 205 



Phe Glu Ser Pro Val Val Leu He Asn Ala Gin Asn Gin Val Ala Thr 
210 215 220 



He Thr Asn Val Asp Ala Gly Val Val Thr Ser Asn He Ala Leu Leu 
225 230 235 240 



Leu Asn Arg Asn Asn Met Ala 
245 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Val Ser Phe Arg Leu Ser Gly Ala Asp Pro Arg Ser Tyr Gly Met 
15 10 15 



Phe He Lys Asp Leu Arg Asn Ala Leu Pro Phe Arg Glu Lys Val Tyr 
20 25 30 



# 
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Asn lie Pro Leu Leu Leu Pro Ser Val Ser Gly Ala Gly Arg Tyr Leu 
35 40 45 



Leu Met His Leu Phe Asn Tyr Asp Gly Lys Thr He Thr Val Ala Val 
50 55 60 



Asp Val Thr Asn Val Tyr He Met Gly Tyr Leu Ala Asp Thr Thr Ser 

70 75 80 



Tyr Phe Phe Asn Glu Pro Ala Ala Glu Leu Ala Ser Gin Tyr Val Phe 
85 90 95 



Arg Asp Ala Arg Arg Lys He Thr Leu Pro Tyr Ser Gly Asn Tyr Glu 
100 105 110 



Arg Leu Gin He Ala Ala Gly Lys Pro Arg Glu Lys He Pro He Gly 
115 120 125 



Leu Pro Ala Leu Asp Ser Ala He Ser Thr Leu Leu His Tyr Asp Ser 
130 135 140 



Thr Ala Ala Ala Gly Ala Leu Leu Val Leu He Gin Thr Thr Ala Glu 
145 150 155 160 



Ala Ala Arg Phe Lys Tyr He Glu Gin Gin He Gin Glu Arg Ala Tyr 
165 170 175 



Arg Asp Glu Val Pro Ser Leu Ala Thr He Ser Leu Glu Asn Ser Trp 
180 185 190 



Ser Gly Leu Ser Lys Gin He Gin Leu Ala Gin Gly Asn Asn Gly He 
195 200 205 



Phe Arg Thr Pro He Val Leu Val Asp Asn Lys Gly Asn Arg Val Gin 
210 215 220 



He Thr Asn Val Thr Ser Lys Val Val Thr Ser Asn He Gin Leu Leu 
225 230 235 240 



Leu Asn Thr Arg Asn He Ala Glu Gly Asp Asn Gly Asp Val Ser Thr 
245 250 255 
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Thr His Gly Phe Ser Ser Thr 
260 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Pro Thr Leu Glu Thr He Ala Ser Leu Asp Leu Asn Asn Pro Thr 
15 10 is 

Thr Tyr Leu Ser Phe He Thr Asn lie Arg Thr Lys Val Ala Asp Lys 
20 25 30 



Thr Glu Gin Cys Thr He Gin Lys He Ser Lys Thr Phe Thr Gin Arg 
35 40 45 



Tyr Ser Tyr He Asp Leu He Val Ser Ser Thr Gin Lys He Thr Leu 
50 55 60 



Ala He Asp Met Ala Asp Leu Tyr Val Leu Gly Tyr Ser Asp He Ale 
65 70 75 80 



Asn Asn Lys Gly Arg Ala Phe Phe Phe Lys Asp Val Thr Glu Ala Val 

85 90 55 



Ala Asn Asn Phe Phe Pro Gly Ala Thr Gly Thr Asn Arg He Lys Leu 
100 105 no 



Thr Phe Thr Gly Ser Tyr Gly Asp Leu Glu Lys Asn Gly Gly Leu Arg 
115 120 125 



Lys Asp Asn Pro Leu Gly He Phe Arg Leu Glu Asn Ser He Val Asn 
130 135 140 



He Tyr Gly Lys Ala Gly Asp Val Lys Lys Gin Ala Lys Phe Phe Leu 
145 150 155 160 
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Leu Ala He Gin Met Val Ser Glu Ala Ala Arg Phe Lys Tyr He Ser 
1^5 170 x75 



Asp^Lys He Pro Ser Glu Lys Tyr Glu Glu Val Thr Val Asp Glu Tyr 
18 ° 185 X90 



Met Thr Ala Leu Glu Asn Asn Trp Ala Lys Leu Ser Thr Ala Val Tyr 
1^5 200 205 



Asn- Ser Lys Pro Ser Thr- Th* Thr Ala Thr Lys Cys Gin Leu Ala Thr 
210 215 220 



Ser Pro Val Thr He Ser- Pro Trp He Phe Lys Thr Val Glu Glu He 
225 230 235 240 



Lys Leu Val Met Gly Leu Leu Lys Ser Ser 
245 250 

INFORMATION FOR SEQ ID NO : 9 : 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

He Asn Thr He Thr Phe Asp Ala Gly Asn Ala Thr He Asn Lys Tyr 
1 5 10 15 



Ala Thr Phe Met Glu Ser Leu Arg Asn Glu Ala Lys Asp Pro Ser Leu 
20 25 30 



Lys Cys. Tyr Gly He Pro Met Leu Pro Asn Thr Asn Ser Thr He Lys 
35 40 45 



Tyr Leu Leu Val Lys Leu Gin Gly Ala Ser Leu Lys Thr He Thr Leu 
50 55 60 



Met Leu Arg Arg Asn Asn Leu Tyr Val Met Gly Tyr Ser Asp Pro Tyr 
65 70 75 80 



Asp Asn Lys Cys Arg Tyr His He Phe Asn Asp He Lys Gly Thr Glu 



# 
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85 90 95 



Tyr Ser Asp Val Glu Asn Thr Leu Cys Pro Ser Ser Asn Pro Arg Val 
100 ' 105 110 



Ala Lys Pro lie Asn Tyr Asn Gly Leu Tyr Pro Thr Leu Glu Lys Lys 
115 120 125 



Ala Gly Val Thr Ser Arg Asn Glu Val Gin Leu Gly lie Gin lie Leu 

•* ' 130 " - 135 140 



Ser Ser Asp lie Gly Lys lie Ser Gly Gin Gly Ser Phe Thr Glu Lys 
145 150 155 160 



He Glu Ala Asp Phe Leu Leu Val Ala He Gin Met Val Ser Glu Ala 
165 170 175 



Ala Arg Phe Lys Tyr He Glu Asn Gin Val Lys Thr Asn Phe Asn Arg 
180 185 190 



Asp Phe Ser Pro Asn Asp Lys Val Leu Asp Leu Glu Glu Asn Trp Gly 
195 200 205 



Lys He Ser Thr Ala He His Asn Ser Lys Asn Gly Ala Leu Pro Lys 
210 215 220 



Pro Leu Glu Leu Lys Asn Ala Asp Gly Thr Lys Trp He Val Leu Arg 
225 230 235 240 



Val Asp Glu He Lys Pro Asp Val Gly Leu Leu Asn Tyr Val Asn Gly 
245 250 255 



Thr Cys Gin Ala Thr 
260 

INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



« 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr 
15 10 15 



Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu 
20 25 30 



Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu 
35 40 45 



Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu 
50 55 60 



Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp 
65 70 75 80 



Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser 
85 90 95 



Ala Glu Ser Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys 
100 105 110 



Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin 
115 120 125 



He Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly He Asp 
130 135 140 



Leu Leu Ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val 
145 150 155 160 



Lys Asp Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu 
165 170 175 



Ala Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn Phe Pro 
180 185 190 



Asn Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp 
195 200 205 



Lys Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe 



# 
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210 215 220 

Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp 
225 ^ 230 235 240 

Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys Ser Ser Asn 
245 250 255 

Glu Ala Asn 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGCTAGATA CCGTGTCATT CTCAACCAAA GGTGC CACTT ATATTACCTA CGTGAATTTC 60 

TTGAATGAGC TACGAGTTAA ATTGAAACCC GAAGGTAACA GCCATGGAAT CCCATTGCTG 120 

CGCAAAAAAT GTGATGATCC TGGAAAGTGT TTCGTTTTGG TAGCGCTTTC AAATGACAAT 180 

GGACAGTTGG CGGAAATAGC TATAGATGTT ACAAGTGTTT ATGTGGTGGG CTATCAAGTA 240 

AGAAACAGAT CTTACTTCTT TAAAGATGCT CCAGATGCTG CTTACGAAGG CCTCTTCAAA 3 00 

AACACAATTA AAACAAGACT TCATTTTGGC GGCAGCTATC CCTCGCTGGA AGGTGAGAAG 360 

GCATATAGAG AGACAACAGA CTTGGGCATT GAACCATTAA GGATTGGCAT CAAGAAACTT 420 

GATGAAAATG CGATAGACAA TT AT AAAC CA ACGGAGATAG CTAGTTCTCT ATTGGTTGTT 480 

ATTCAAATGG TGTCTGAAGC AGCTCGATTC ACCTTTATTG AGAACCAAAT TAGAAATAAC 540 

TTTCAACAGA GAATTCGCCC GGCGAATAAT ACAATCAGCC TTGAGAATAA ATGGGGTAAA 600 

CTCTCGTTCC AGATCCGGAC ATCAGGTGCA AATGGAATGT TTTCGGAGGC AGTTGAATTG 660 

GAACGTGCAA ATGGCAAAAA ATACTATGTC ACCGCAGTTG ATCAAGTAAA ACCCAAAATA 720 

GCACTCTTGA AGTTCGTCGA TAAAGATCCT AAAACGAGCC TTGCTGCTGA ATTGATAATC 780 

CAGAACTATG AGTCATTAGT GGGCTTTGAT TAG 813 
(2) INFORMATION FOR SEQ ID NO: 12: 



# 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

- (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 



ATGGCGGCAA 


AGATGGCGAA 


GAACGTGGAC 


AAGCCGCTCT 


TCACCGCGAC 


GTTCAACGTC 


60 


CAGGC CAGCT 


CCGCCGACTA 


CGCCACCTTC 


ATCGCCGGCA 


TCCGCAACAA 


GCTCCGCAAC 


120 


CCGGCGCACT 


TCTCCCACAA 


CCGCCCCGTG 


CTGCCGCCGG 


TCGAGCCCAA 


CGTCCCGCCG 


180 


AGCAGGTGGT 


TCCACGTCGT 


GCTCAAGGCC 


TCGCCGACCA 


GCGCCGGGCT 


CACGCTGGCC 


240 


ATCCGCGCGG 


ACAACATCTA 


CCTGGAGGGC 


TTCAAGAGCA 


GCGACGGCAC 


CTGGTGGGAG 


300 


CTCACCCCGG 


GCCTCATCCC 


CGGCGCCACC 


TACGTCGGGT 


TCGGCGGCAC 


CTACCGCGAC 


360 


CTCCTCGGCG 


ACACCGACAA 


GCTAACCAAC 


GTCGCTCTCG 


GCCGACAGCA 


GCTGGCGGAC 


420 


GCGGTGACCG 


CGCTCCACGG 


GCGCACCAAG 


GCCGACAAGG 


CCTCCGGCCC 


GAAGCAGCAG 


430 


CAGGCGAGGG 


AGGCGGTGAC 


GACGCTGGTC 


CTCATGGTGA ACGAGGCCAC 


GCGGTTCCAG 


540 


ACGGTGTCTG 


GGTTCGTGGC 


CGGGTTGCTG 


CACCCCAAGG 


CGGTGGAGAA 


GAAGAGCGGG 


600 


AAGATCGGCA 


ATGAGATGAA 


GGCCCAGGTG 


AACGGGTGGC 


AGGACCTGTC 


CGCGGCGCTG 


660 


CTGAAGACGG 


ACGTGAAGCC 


TCCGCCGGGA 


AAGTCGCCAG 


CGAAGTTCGC 


GCCGATCGAG 


720 


AAGATGGGCG 


TGAGGACGGC 


TGAACAGGCC 


GCCAACACGC 


TGGGGATCCT 


GCTGTTCGTG 


780 


GAGGTGCCGG 


GTGGGTTGAC 


GGTGGCCAAG 


GCGCTGGAGC 


TGTTCCATGC 


GAGTGGTGGG 


340 


AAATAG 












846 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CGTCCGAAAA TGGTGAAATG CTTACTACTT TCTTTTTTAA TTATCGCCAT CTTCATTGGT 



GTTCCTACTG CCAAAGGCGA TGTTAACTTC GATTTGTCGA CTGCCACTGC AAAAACCTAC 



60 
120 
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ACAAAATTTA 


TCGAAGATTT 


CAGGGCGACT 


CTTCCATTTA 


GCCATAAAGT 


GTATGATATA 


180 


CCTCTACTGT 


ATTCCACTAT 


TTCCGACTCC 


AGACGTTTCA 


TACTCCTCGA 


TCTTACAAGT 


240 


TATGCATATG 


AAACCATCTC 


GGTGGCCATA 


GATGTGACGA 


ACGTTTATGT 


TGTGGCGTAT 


300 


CGCACCCGCG 


ATGTATCCTA 


CTTTTTTAAA 


GAATCTCCTC 


CTGAAGCTTA 


TAACATCCTA 


360 


TTCAAAGGTA 


CGCGGAAAAT 


TACACTGCCA 


TATACCGGTA 


ATTATGAAAA 


TCTTCAAACT 


420 


GCTGCACACA 


AAATAAGAGA 


GAATATTGAT 


CTTGGACTCC 


CTGCCTTGAG 


TAGTGCCATT 


480 


ACCACATTGT 


TTTATTACAA 


TGCCCAATCT 


GCTCCTTCTG 


CATTGCTTGT 


ACTAATCCAG 


540 


ACGACTGCAG 


AAGCTGCAAG 


ATTTAAGTAT 


ATCGAGCGAC 


ACGTTGCTAA 


GTATGTTGCC 


600 


ACTAACTTTA 


AGCCAAATCT 


AGCCATCATA 


AGCTTGGAAA 


ATCAATGGTC 


TGCTCTCTCC 


660 


AACAAATCTT 


TTTGGCGCAG 


AATCAAGGAG 


GAAAATTTAG 


AAATCCTGTC 


GACCTTATAA 


720 


AAC CTACCGG 


GGAACGGTTT 


CAAGTAACCA 


ATGTTGATTC 


AGATGTTGTA 


AAAGGTAATA 


780 


TCAAACTCCT 


GCTGAACTCC 


AGAGCTAGCA 


CTGCTGATGA 


AAACTTTATC 


ACAACCATGA 


840 


CTCTACTTGG 


GGAATCTGTT 


GTGAATTGAA 


AGTTTAATAA 


TCCACCCATA 


TCGAAATAAG 


900 


G CATGTT CAT 


GAC 










9X3 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTYAARGAYG CNCCNGAYGC NGCNTAYGAR GG 32 
(2) INFORMATXON FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
ACYTGRTCNA CNGCNGTNAC RTARTAYTTY TT 32 




-163- 

(2} INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO; IS: 
GGNYTNGAYA CNGTNWSNTT YWSNACNAAR GG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AATGGTTCAA TGCCCAAGTC TGT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
TGTCTCTCTA TATGCCTTCT CAC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 




32 



23 



TCAACCCGGG CTAGATACCG TGTCATTCTC AACCAAAGGT GCCACTTATA TTA 



53 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

- (C) STRAND ED NESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTTCATTTTG GCGGCACGTA TCC 23 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21; 
CTCGAGGCTG CAAGCTTACG TGGGATTTTT TmTTTTTT TTTTTT 46 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTCGCTGGAA GGTGAGAA 13 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23; 



CTCGAGGCTG CAAGCTTACG TGGGA 



25 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
_ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
TGATCTCGAG TACTATTTAG GATCTTTATC GACGA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25 
GTAAGCAGCA TCTGGAGCAT CT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CATTCAAGAA ATTCACGTAG G 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
GGCCTGGACA CCGTGAGCTT TAG 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
-(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TCGATTGCGA TCCTAAATAG TACTC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
TTTAGGATCG CAATCGACGA ACTTCAAG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTTCGTCTGT AAAGATCCTA AATAGTACTC GA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GGATCTTTAC AGACGAACTT CAAGAGT 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
TCTTGTGCTT CGTCGATAAA GATCC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
ATCGACGAAG CACAAGAGTG CTATTTT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GTAAAACCAT GCATAGCACT CTTGAAGTTC GT 
(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
AGTGCTATGC ATGGTTTTAC TTGATCAACT GC 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
"(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

AGCACATGTG GTGCCACTTA TATTACCTA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
TAAGTGGCAC CACATGTGCT AAAGCTCACG GTG 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TGACTGTGGA CAGTTGGCGG AAATA 
(2) INFORMATION* FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
GCCAACTGTC CACAGTCATT TGAAAGCGCT ACC 
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(2) INFORMATION FOR SEQ ID NO: 40; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:4C 
GATGATCCTG GAAAGGCTTT CGTTTTGGTA GCGCTT 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41 
AAGCCTTTCC AGGATCATCA GCTTTTTTGC GCAGCAATGG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
AAGCCTTTCC AGGATCATCA CAT 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 



GCGACTCTCT ACTGTTTC 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CGTTAGCAAT TTAACTGTGA T 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
AACAGCTATG ACCATG 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGAACTCGAG GAAAACTACC TATTTCCCAC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GCATTACATC CATGGCGGC 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GATATCTCGA GTTAACTATT TCCCACCACA CGCATGGAAC AGCTCCAGCG CCTTGGCCAC 60 
CGTC 64 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

TGTCTGTTCG TGGAGGTGCC G 21 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAAGTGTCT GGAGCTGTTC CATGCGA 27 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
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GATGTTAAYT TYGAYTTGTC NACDGCTAC 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) -SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
<B) TYPE : nucleic acid 
(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
ATTGGNAGDG TAGCCCTRAA RTCYTCDAT 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
GCCACTGCAA AAACCTACAC AAAATTTATT GA 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
GATGTTAACT TCGATTTGTC GA 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
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TCAACTCGAG GTACTCAATT CACAACAGAT TCC 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) ^SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Cys His His His Ala Ser Arg Val Ala Arg Met Ala Ser Asp Glu Phe 
15 10 15 



Pro Ser Met Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Ser Gly Gin Ala Gly Ala Ala Ala Ser Glu Ser Leu Phe lie Ser 
15 10 15 



Asn His Ala Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CAGCCATGGA ATCCCATTGC TG 
(2) INFORMATION FOR SEQ ID NO: 59: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
CACATGTAAA ACAAGACTTC ATTTTGGC 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TGAAGTCTTG TTTTAGATGT GTTTTTGAAG AGGCCT 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
ATGCCATATG CAATTATAAA CCAACGGAGA 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GGTTTATAAT TGCATATGGC ATTTTCATCA AGTTTCTTG 
(2) INFORMATION FOR SEQ ID NO: 63: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
_ (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
CTTTCAACAA TGCATTCGCC CGGCGAATAA TAC 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
GCGAATGCAT TGTTGAAAGT TATTTCTAAT TTG 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

GTTTTGTGAG GCAGTTGAAT TGGAAC 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
TTCAACTGCC TCACAAAACA TTCCATTTGC ACCT 
(2) INFORMATION FOR SEQ ID NO: 67: 



-176- 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
- {D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AAAAGCTGAT GATCCTGGAA AGTG 24 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
TCCAGGATCA TCAGCTTTTT TGCGCAGCAA TGGGA 35 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GACATCCAGA TGACTCAGTC TCCATCTTCC ATGTCTGCAT CTCTGGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCAGAAACCA 120 

GGGAAATCTC CTAAGACCCT GATCTATCGT GCAAACAGAT TGGTAGATGG GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG GAATTTATTA TTGTCAACAG TATGATGAGT CTCCGTGGAC GTTCGGTGGA 3 00 

GGCACCAAGC TTGAAATCAA A 321 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) "SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

CAGATCCAGT TGGTGCAGTC TGGACCTGGC CTGAAGAAGC CTGGAGGGTC CGTCAGAATC 60 

TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GAAGCAGGCT 120 

CCAGGAAAGG GTTTAAGGTG GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATGACT TCAAGGGACG GTTTACCTTC TCTTTGGACA CGTCTAAGAG CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTACAT ATTTCTGTAC AAGACGGGGT 300 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCC 354 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

GAGATC CAGT TGGTGCAGTC TGGAGGAGGC CTGGTGAAGC CTGGAGGGTC CGTCAGAATC 60 

TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GCGCCAGGCT 120 

CCAGGAAAGG GTTTAGAGTG GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATTCTT TCAAGGGACG GTTTACCTTC TCTTTGGACG ATTCTAAGAA CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTGTGT ATTTCTGTAC AAGACGGGGT 3 00 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCC 354 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
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GACATCCAGA TGACTCAGTC TCCATCTTCC CTGTCTGCAT CTGTAGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCAGAAACCA 120 

GGGAAAGCTC CTAAGACCCT GATCTATCGT GCAAACAGAT TGGAATCTGG GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG GAATTTATTA TTGTCAACAG TATGATGAGT CTCCGTGGAC GTTCGGTGGA 3 00 

GGCACCAAGC TTGAAATCAA A 321 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
TGTCATCATC ATGCATCGCG AGTTGCCAGA ATGGCATCTG ATGAGTTTCC TTCTATGTGC 60 
GCAAGTACTC 70 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
TCGAGAGTAC TTGCGCACAT AGAAGGAAAC TCATCAGATG CCATTCTGGC AACTCGCGAT 60 
GCATGATGAT GACATGCA 78 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
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TGTTCGGCCG CATGTCATCA TCATGCATCG 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
AGTCATGCCC CGCGC 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
TCCCGGCTGT CCTACAGT 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
TCCAGCCTGT CCAGATGGTG TGTGAGTTTT GTCACAA 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
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CTAACTCGAG AGTACTGTAT GCATGGTTCG AGATGAACAA AGATTCTGAG GCTGCAGCTC 60 

76 



CAGCCTGTCC AGATGG 
(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CTAACTCGAG AGTACTGTAT 20 
(2) INFORMATION FOR SEQ ID NO: 81: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
TCCAGCCTGT CCAGATGGAC ACTCTCCCCT GTTGAA 36 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GTACAGTGGA AGGTGGAT 18 
(2) INFORMATION FOR SEQ ID NO: 83: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
CATGCGGCCG ATTTAGGATC TTTATCGACG A 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
AACATCCAGT TGGTGCAGTC TG 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
GAGGAGACGG TGACCGTGGT 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8€ 
GACATCAAGA TGACCCAGT 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 87: 
GTTTGATTTC AAGCTTGGTG C 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
ACTTCGGCCG CACCATCTGG ACAGGCTGGA G 31 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS ? > 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GACATCCAGA TGACTCAGTC TCCATCTTCC CTGTCTGCAT CTGTAGGAGA CAGAGTCACT 60 

ATCACTTGCC GGGCGAGTCA GGACATTAAT AGCTATTTAA GCTGGTTCCA GCAGAAACCA 120 

GGGAAAGCTC CTAAGACCCT GATCTATCGT GCAAACAGAT TGGAATCTGG GGTCCCATCA 180 

AGGTTCAGTG GCAGTGGATC TGGGACAGAT TATACTCTCA CCATCAGCAG CCTGCAATAT 240 

GAAGATTTTG GAATTTATTA TTGTCAACAG TATGATGAGT CTCCGTGGAC GTTCGGTGGA 300 

GGCACCAAGC TTGAGATGAA AGGTGGCGGT. GGATCTGGTG GAGGTGGGTC CGGAGGTGGA 360 

GGATCTGAGA TCCAGTTGGT GCAGTCTGGA GGAGGCCTGG TGAAGCCTGG AGGGTCCGTC 420 

AGAATCTCCT GCGCAGCTTC TGGGTATACC TTCACAAACT ATGGAATGAA CTGGGTGCGC 480 

CAGGCTCCAG GAAAGGGTTT AGAGTGGATG GGCTGGATAA ACACCCACAC TGGAGAGCCA 540 

ACATATGCTG ATTCTTTCAA GGGACGGTTT ACCTTCTCTT TGGACGATTC TAAGAACACT 600 

GCCTATTTAC AGATCAACAG CCTCAGAGCC GAGGACACGG CTGTGTATTT CTGTACAAGA 660 

CGGGGTTACG ACTGGTACTT CGATGTCTGG GGCCAAGGGA CCACGGTCAC CGTCTCCTCA 720 

TGA 723 
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(2) INFORMATION FOR SEQ ID NO: 90: 

Ci) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 723 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GAGATCCAGT TGGTGCAGTC TGGAGGAGGC CTGGTGAAGC CTGGAGGGTC CGTCAGAATC 60 

TCCTGCGCAG CTTCTGGGTA TACCTTCACA AACTATGGAA TGAACTGGGT GCGCCAGGCT 120 

CCAGGAAAGG GTTTAGAGTG GATGGGCTGG ATAAACACCC ACACTGGAGA GCCAACATAT 180 

GCTGATTCTT TCAAGGGACG GTTTACCTTC TCTTTGGACG ATTCTAAGAA CACTGCCTAT 240 

TTACAGATCA ACAGCCTCAG AGCCGAGGAC ACGGCTGTGT ATTTCTGTAC AAGACGGGGT 3 00 

TACGACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCAGGTGGC 360 

GGTGGATCTG GTGGAGGTGG GTCCGGAGGT GGAGGATCTG ACATCCAGAT GACTCAGTCT 420 

CCATCTTCCC TGTCTGCATC TGTAGGAGAC AGAGTCACTA TCACTTGCCG GGCGAGTCAG 480 

GACATTAATA GCTATTTAAG CTGGTTCCAG CAGAAACCAG GGAAAGCTCC TAAGACCCTG 540 

ATCTATCGTG CAAACAGATT GGAATCTGGG GTCCCATCAA GGTTCAGTGG CAGTGGATCT 600 

GGGACAGATT ATACTCTCAC CATCAGCAGC CTGCAATATG AAGATTTTGG AATTTATTAT 660 

TGTCAACAGT ATGATGAGTC TCCGTGGACG TTCGGTGGAG GCACCAAGCT TGAGATGAAA 720 

TGA 723 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) . TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
CGGACCCACC TCCACCAGAT CCACCGCCAC CTTTCATCTC AAGCTTGGTG C 51 
(2) INFORMATION FOR SEQ ID NO: 92: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GACATCCAGA TGACTCAGT 19 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GGTGGAGGTG GGTCCGGAGG TGGAGGATCT GAGATCCAGT TGGTGCAGT 4 9 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TGTACTCGAG CCCATCATGA GGAGACGGTG ACCGT 35 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
GGTGGAGGTG GGTCCGGAGG TGGAGGATCT GACATCCAGA TGACTCAGT 49 



(2) INFORMATION FOR SEQ ID NO: 96: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
-(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
TGTACTCGAG CCCATCATTT CATCTCAAGC TTGGTGC 37 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GAGATCCAGT TGGTGCAGTC TG 22 
(2) INFORMATION FOR SEQ ID NO : 98 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CGGACCCACC TCCACCAGAT CCACCGCCAC CTGAGGAGAC GGTGACCGT 49 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
15 10 15 
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Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 
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Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 100: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 IS 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 

65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
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85 90 95 



Gly Leu Phe Lys Asn Thr He Lya Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



lie Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr lie 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 



Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
€5 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 
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Ile Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 
165 170 175 



lie Arg Asn Asn Phe Gin Gin Arg lie Arg Pro Ala Asn Asn Thr lie 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin lie Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys lie 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
1 ' 5 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 
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Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 
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Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys lie 
225 230 235 240 



Ala Leu Leu Lys Phe Val Cys Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu lie Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
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115 120 125 



Gly lie Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
2X0 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Cys Asp Pro Lys 
245 250 

INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 



Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 
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Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 
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Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin lie Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Cys lie 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 105: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr lie Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu lie Ala lie Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



• 
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Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr lie Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



lie Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Cys Phe Val Asp Lys Asp Pro Lys 
245 250 



INFORMATION FOR SEQ ID NO: 106: 



• 



-197- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

Gly Leu Asp Thr Val Ser Phe Ser Thr Cys Gly Ala Thr Tyr lie Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 30 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
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145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 
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Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Cys Gly Gin Leu Ala 
50 55 60 



Glu lie Ala lie Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr lie Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly lie Glu Pro Leu Arg lie Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



lie Asp Asn Tyr Lys Pro Thr Glu lie Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



lie Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 
165 170 175 



lie Arg Asn Asn Phe Gin Gin Arg lie Arg Pro Ala Asn Asn Thr lie 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin lie Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



# 
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Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 
1 5 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr Cys Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 
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Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly lie Glu Pro Leu Arg lie Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



lie Asp Asn Tyr Lys Pro Thr Glu lie Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



lie Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr 



# 
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10 is 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly 
35 40 45 



Lys Cys Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu lie Ala lie Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr lie Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



He Asp Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



He Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Cys He Arg Pro Ala Asn Asn Thr He 
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180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 110: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Cys Gly Ala Thr Tyr He Thr 
15 10 15 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 



Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 
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Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr lie Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly lie Glu Pro Leu Arg lie Gly lie Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



lie Asp Asn Tyr Lys Pro Thr Glu lie Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



lie Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 
165 170 175 



lie Arg Asn Asn Phe Gin Gin Arg lie Arg Pro Ala Asn Asn Thr He 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin He Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys He 
225 230 235 240 



Ala Leu Leu Lys Phe Val Asp Lys Asp Pro Lys 
245 250 
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(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Gly Leu Asp Thr Val Ser Phe Ser Thr Cys Gly Ala Thr Tyr He Thr 
15 10 is 



Tyr Val Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly 
20 25 30 



Asn Ser His Gly He Pro Leu Leu Arg Lys Lys Ala Asp Asp Pro Gly 
35 40 45 



Lys Ala Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala 
50 55 60 



Glu He Ala He Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val 
65 70 75 80 



Arg Asn Arg Ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu 
85 90 95 



Gly Leu Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser 
100 105 110 



Tyr Pro Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu 
115 120 125 



Gly He Glu Pro Leu Arg He Gly He Lys Lys Leu Asp Glu Asn Ala 
130 135 140 



# 
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Ile Asp Asn Tyr Lys Pro Thr Glu lie Ala Ser Ser Leu Leu Val Val 
145 150 155 160 



lie Gin Met Val Ser Glu Ala Ala Arg Phe Thr Phe lie Glu Asn Gin 
165 170 175 



He Arg Asn Asn Phe Gin Gin Arg lie Arg Pro Ala Asn Asn Thr lie 
180 185 190 



Ser Leu Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin lie Arg Thr Ser 
195 200 205 



Gly Ala Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn 
210 215 220 



Gly Lys Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys lie 
225 230 235 240 



Ala Leu Leu Lys Phe Val Cys Lys Asp Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
TGATGCGGCC GACATCTCAA GCTTGGTGC 29 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 
TGATGCGGCC GACATCTCAA GCTTGGTGC 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 
TCTAGGTCAC CGTCTCCTCA CCATCTGGAC AGGCTGGA 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TTCGAAGCTT GAGATGAAAC CATCTGGACA GGCTGGA 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

AGTCGTCGAC ACGATGGACA TGAGGAC 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 98 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:117: 



AGTCGTCGAC ACGATGGACA TGAGGACCCC TGCTCAGTTT CTTGGCATCC TCCTACTCTG 
GTTTCCAGGT ATCAAATGTG ACATCCAGAT GACTCAGT 
(2) INFORMATION FOR SEQ ID NO: 118: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 



TCACTTGCCG GGCGAATCAG GACATTAATA GCTATTTAAG CTGGTTCCAG CAGAAACCAG 
GGAAAGCTCC TAAGACCCT 



(2) INFORMATION FOR SEQ ID NO: 119: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



TGACTCGCCC GGCAAGTGAT AGTGACTCTG TCTCCTACAG ATGCAGACAG GGAAGATGGA 60 
GACTGAGTCA TCTGGATGTC 80 



(2) INFORMATION FOR SEQ ID NO: 120: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
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GATCCACTGC CACTGAACCT TGATGGGACC CCAGATTCCA ATCTGTTTGC ACGATAGATC 60 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GGTTCAGTGG CAGTGGATCT GGGACAGATT ATACTCTCAC CATCAGCAGC CTGCAATATG 60 

AAGATTTTGG AATTTATTAT TG 82 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GTTTGATTTC AAGCTTGGTG CCTCCACCGA ACGTCCACGG AGACTCATCA TACTGTTGAC 60 
AATAATAAAT TCCAAAATCT TC 82 
(2) INFORMATION FOR SEQ ID NO: 123: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Asp He Lys Met Thr Gin Ser Pro Ser Ser Met Tyr Ala Ser Leu Gly 



AGGGTCTTAG GAGCTTTCC 



79 



1 



5 



10 



IS 



Glu Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp He Asn Ser Tyr 
20 25 30 



-210- 



Leu Ser Trp Phe His His Lys Pro Gly Lys Ser Pro Lys Thr Leu lie 
35 40 45 



Tyr Arg Ala Asn Arg Leu Val Asp Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 



Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr lie Ser Ser Leu Asp Tyr 
65 70 75 80 



Glu Asp Met Gly He Tyr Tyr Cys Gin Gin Tyr Asp Glu Ser Pro Trp 
85 90 95 



Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Gin He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 
15 10 15 



Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 
20 25 30 



Gly Met Asn Trp Val Lys Gin Ala Pro Gly Lys Gly Leu Arg Trp Met 
35 40 45 



Gly Trp He Asn Thr His Thr Gly Glu Pro Thr Tyr Ala Asp Asp Phe 
50 55 SO 



# 
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Lys Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Ser Thr Ala Tyr 
65 70 75 80 



Leu Gin lie Asn Asn Leu Lys Asn Glu Asp Thr Ala Thr Tyr Phe Cys 
85 90 95 



Thr Arg Arg Gly Tyr Asp Trp Tyr Phe Asp Val Trp Gly Ala Gly Thr 
100 105 no 



Thr Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 



Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Asp lie Asn Ser Tyr 

20 25 30 



Leu Ser Trp Phe Gin Gin Lys Pro Gly Lys Ala Pro Lys Thr Leu lie 
35 40 45 



Tyr Arg Ala Asn Arg Leu Glu Ser Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 



Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr lie Ser Ser Leu Gin Tyr 
65 70 75 80 



# 
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Glu Asp Phe Gly He Tyr Tyr Cys Gin Gin Tyr Asp Glu Ser Pro Trp 
85 90 95 



Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Glu lie Gin Leu Val Gin Ser Gly Gly Gly Leu Val Lys Pro Gly Gly 
15 10 15 



Ser Val Arg lie Ser Cys Ala Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 
20 25 30 



Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Met 
35 40 45 



Gly Trp lie Asn Thr His Thr Gly Glu Pro Thr Tyr Ala Asp Ser Phe 
50 55 60 



Lys Gly Arg Phe Thr Phe Ser Leu Asp Asp Ser Lys Asn Thr Ala Tyr 
65 70 75 80 



Leu Gin lie Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Phe Cys 
85 90 95 



Thr Arg Arg Gly Tyr Asp Trp Tyr Phe Asp Val Trp Gly Gin Gly Thr 
100 105 110 



# 
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Thr Val Thr val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
1 5 10 is 



Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe He Ala Gly 
20 25 30 



He Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 



Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 60 



Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 



Arg Ala Asp Asn He Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 
85 90 95 



Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 no 



Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu Thr 
115 120 125 



Asn Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
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130 135 140 



His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lys Gin Gin Gin 
145 150 155 160 



Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 
165 170 175 



Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 



Ala Val Glu Lys Lys Ser Gly Lys He Gly Asn Glu Met Lys Ala Gin 
195 200 205 



Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 



Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro He Glu Lys 
225 230 235 240 



Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Leu 
245 250 255 



Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu 
260 265 270 



Leu Phe His Ala Cys Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
1 5 10 is 



Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe He Ala Gly 
20 25 30 



He Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 



Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 60 



Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 



Arg Ala Asp Asn He Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 
85 90 95 



Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 no 



Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu Thr 
115 120 125 



Asn Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 



His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lys Gin Gin Gin 
145 150 155 160 



Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 
165 170 175 
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Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 



Ala Val Glu Lys Lys Ser Gly Lys lie Gly Asn Glu Met Lys Ala Gin 
195 200 205 



Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 



Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro lie Glu Lys 
225 230 235 240 



Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Leu 
245 250 255 



Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Cys Leu Glu 
260 265 270 



Leu Phe His Ala Ser Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Phe Thr Ala Thr 
15 10 15 



Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe He Ala Gly 
20 25 30 
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Ile Arg Asn Lys Leu Arg Asn Pro Ala His Phe Ser His Asn Arg Pro 
35 40 45 



Val Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Phe His 
50 55 60 



Val Val Leu Lys Ala Ser Pro Thr Ser Ala Gly Leu Thr Leu Ala He 
65 70 75 80 



Arg Ala Asp Asn He Tyr Leu Glu Gly Phe Lys Ser Ser Asp Gly Thr 
85 90 95 



Trp Trp Glu Leu Thr Pro Gly Leu He Pro Gly Ala Thr Tyr Val Gly 
100 105 no 



Phe Gly Gly Thr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu Thr 
115 120 125 



Asn Val Ala Leu Gly Arg Gin Gin Leu Ala Asp Ala Val Thr Ala Leu 
130 135 140 



His Gly Arg Thr Lys Ala Asp Lys Ala Ser Gly Pro Lys Gin Gin Gin 
145 150 155 160 



Ala Arg Glu Ala Val Thr Thr Leu Val Leu Met Val Asn Glu Ala Thr 
165 170 175 



Arg Phe Gin Thr Val Ser Gly Phe Val Ala Gly Leu Leu His Pro Lys 
180 185 190 



Ala Val Glu Lys Lys Ser Gly Lys He Gly Asn Glu Met Lys Ala Gin 
195 200 205 
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Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val 
210 215 220 



Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro He Glu Lys 
225 230 235 240 



Met Gly Val Arg Thr Ala Glu Gin Ala Ala Asn Thr Leu Gly He Cys 
245 250 255 



Leu Phe Val Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu 
260 265 270 



Leu Phe His Ala Ser Gly Gly Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Ser Cys Asp Lys Thr His Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:131: 
TGTCGACATC ATGGCTTGGG TGTGGACCTT GCTATTCCTG ATGGCAGCTG CCCAAAGTGC 60 
CCAAGCAGAG ATCCAGTTGG TGCAG 85 



(2) INFORMATION FOR SEQ ID NO: 132: 



-219- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
AAGGTATACC CAGAAGCTGC GCAGGAGATT CTGACGGACC CTCCAGGCTT CACCAGGCCT 60 
CCTCCAGACT GCACCAACTG GATCTC 86 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
GCAGCTTCTG GGTATACCTT CACAAACTAT GGAATGAACT GGGTGCGCCA GGCTCCAGGA 60 
AAGAATTTAG AGTGGATGGG CTGG 34 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
AAAGAGAAGG TAAACCGTCC CTTGAAAGAA TCAGCATATG TTGGCTCTCC AGTGTGGGTG 60 
TTTATCCAGC CCATCCACTC TAAAC 85 
(2) INFORMATION FOR SEQ ID NO: 135: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
GACGGTTTAC CTTCTCTTTG GACGATTCTA AGAACACTGC CTATTTACAG ATCAACAGCC 60 
TCAGAGCCGA GGACACGGCT GTGTATT 87 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:136: 
GAGGAGACGG TGACCGTGGT CCCTTGGCCC CAGACATCGA AGTACCAGTC GTAACCCCGT 60 
CTTGTACAGA AATACACAGC CGTGTCCTCG GC 92 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 
GCAGCTTCTG GGTATACCTT CACAAACTAT GGAATGAACT GGGTGAAGCA GGCTCCAGGA 60 
AAGGGTTTAA GGTGGATGGG CTGG 84 
(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
AAAGAGAAGG TAAACCGTCC CTTGAAGTCA TCAGCATATG TTGGCTCTCC AGTGTGGGTG 60 
TTTATCCAGC CCATCCACCT TAAAC 85 



(2) INFORMATION FOR SEQ ID NO: 139: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
GACGGTTTAC CTTCTCTTTG GACACGTCTA AGTGCACTGC CTATTTACAG ATCAACAGCC 6 0 
TCAGAGCCGA GGACACGGCT ACAT 84 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
AGGAGACGGT GACCGTGGTC CCTTGGCCCC AGACATCGAA GTACCAGTCG TAACCCCGTC €0 
TTGTACAGAA ATATGTAGCC GTGTCCTCGG C 91 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Lys Pro Ala Lys Phe Phe Arg Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



Lys Pro Ala Lys Phe Leu Arg Leu 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
GGCCGCAAAG CCGGCTAAGT TCTTMCGTCT GAGT 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
ACTCAGACGK AAGAACTTAG CCGGCTTTGC 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AAGGTATACC CAGAAGCTGC GCAGGAGATT CTGACGGACC CTCCAGGCTT CTTCAGGCCA 
GGTCCAGACT GCACCAACTG GATCT 



(2) INFORMATION FOR SEQ ID NO: 146 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



-223- 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
ACTAGTGTCG ACATCATGGC TTGGGT 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 5 10 15 

Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp He Asn Ser Tyr 
20 25 30 

Leu Ser Trp Phe Gin Gin Lys Pro Gly Lys Ala Pro Lys Thr Leu He 
35 40 45 

Tyr Arg Ala Asn Arg Leu Glu Ser Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He Ser Ser Leu Gin Tyr 
65 70 75 80 

Glu Asp Phe Gly He Tyr Tyr Cys Gin Gin Tyr Asp Glu Ser Pro Trp 
85 90 95 

Thr Phe Gly Gly Gly Thr Lys Leu Glu Met Lys Gly Gly Gly Gly Ser 
100 105 110 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu He Gin Leu Val Gin 
115 120 125 

Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Val Arg He Ser Cys 
130 135 140 

Ala Ala Ser Gly Tyr Thr Phe Thr Asn Tyr Gly Met Asn Trp Val Arg 
145 150 155 160 

Gin Ala Pro Gly Lys Gly Leu Glu Trp Met Gly Trp He Asn Thr His 
165 170 175 

Thr Gly Glu Pro Thr Tyr Ala Asp Ser Phe Lys Gly Arg Phe Thr Phe 
180 185 190 

Ser Leu Asp Asp Ser Lys Asn Thr Ala Tyr Leu Gin He Asn Ser Leu 
195 200 205 



26 
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Arg Ala Glu Asp Thr Ala Val Tyr Phe Cys Thr Arg Arg Gly Tyr Asp 
210 215 220 

Trp Tyr Phe Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Glu lie Gin Leu Val Gin Ser Gly Gly Gly Leu Val Lys Pro Gly Gly 
15 10 15 

Ser Val Arg lie Ser Cys Ala Ala Ser Gly Tyr Thr Phe Thr Asn Tyr 
20 25 30 

Gly Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Met 
35 40 45 

Gly Trp lie Asn Thr His Thr Gly Glu Pro Thr Tyr Ala Asp Ser Phe 
50 55 60 

Lys Gly Arg Phe Thr Phe Ser Leu Asp Asp Ser Lys Asn Thr Ala Tyr 
65 70 75 80 

Leu Gin lie Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Phe Cys 
35 90 95 

Thr Arg Arg Gly Tyr Asp Trp Tyr Phe Asp Val Trp Gly Gin Gly Thr 
100 105 110 

Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
115 120 125 

Gly Gly Gly Gly Ser Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu 
130 135 140 

Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin 
145 150 155 160 

Asp lie Asn Ser Tyr Leu Ser Trp Phe Gin Gin Lys Pro Gly Lys Ala 
165 170 175 

Pro Lys Thr Leu lie Tyr Arg Ala Asn Arg Leu Glu Ser Gly Val Pro 
180 185 190 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He 
195 200 205 
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Ser Ser Leu Gin Tyr Glu Asp Phe Gly lie Tyr Tyr Cys Gin Gin Tyr 
210 215 220 

Asp Glu Ser Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu Glu Met Lys 
225 230 235 240 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Xaa lie Ser Xaa Tyr 
20 25 30 

Leu Xaa Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie 
35 40 45 

Tyr Ala Ala Ser Xaa Leu Xaa Ser Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Xaa Phe Thr Leu Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin Tyr Xaa Xaa Xaa Pro Xaa 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Glu He Val Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Val Ser Ser Ser 
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20 25 30 

Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu 
35 40 45 

He Tyr Gly Ala Ser Ser Arg Ala Thr Gly He Pro Asp Arg Phe Ser 
50 55 60 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Arg Leu Glu 
65 70 75 80 

Pro Gly Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Ser Ser Pro 
85 90 95 

Xaa Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
100 105 

INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Asp He Val Met Thr Gin Ser Pro Leu Ser Leu Pro Val Thr Pro Gly 
15 10 15 

Glu Pro Ala Ser He Ser Cys Arg Ser Ser Gin Ser Leu Leu Asn Asn 
20 25 30 

Tyr Leu Asn Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Gin Leu Leu 
35 40 45 

He Tyr Leu Gly Ser Asn Arg Ala Ser Gly Val Pro Asp Arg Phe Ser 
50 55 60 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys He Ser Arg Val Glu 
65 70 75 30 

Ala Glu Asp Val Gly Val Tyr Tyr Cys Met Gin Ala Leu Gin Xaa Pro 
85 90 95 

Xaa Thr Phe Gly Gin Gly Thr Lys Xaa Glu He Lys 
100 105 

INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Xaa Ser Val Leu Thr Gin Pro Pro Ser Ala Ser Gly Thr Pro Gly Gin 
15 10 15 

Arg Val Thr He Ser Cys Ser Gly Ser Ser Ser He Gly Xaa Asn Xaa 
20 25 30 

Val Xaa Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys Leu Leu He 
35 40 45 

Tyr Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Lys 
50 55 60 

Ser Gly Thr Ser Ala Ser Leu Ala He Ser Gly Leu Gin Ser Glu Asp 
65 70 75 80 

Glu Ala Asp Tyr Tyr Cys Ala Thr Trp Asp Asp Ser Leu Asp Pro Val 
85 90 95 

Phe Gly Gly Gly Thr Lys Thr Val Leu Gly 
100 105 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Xaa Ser Ala Leu Thr Gin Pro Ala Ser Val Ser Gly Ser Pro Gly Gin 
15 10 15 

Ser He Thr He Ser Cys Thr Gly Thr Ser Ser Val Gly Tyr Asn Xaa 
20 25 30 

Val Ser Trp Tyr Gin Gin His Pro Gly Lys Ala Pro Lys Leu He Tyr 
35 40 45 

Asp Val Arg Pro Ser Gly Val Arg Phe Ser Gly Ser Lys Ser Gly Asn 
50 55 60 

Thr Ala Ser Leu Thr He Ser Gly Leu Gin Ala Glu Asp Glu Ala Asp 
65 70 75 80 

Tyr Tyr Cys Ser Ser Tyr Xaa Gly Xaa Xaa Xaa Xaa Val Phe Gly Gly 
85 90 95 

Gly Thr Lys Leu Thr Val Leu Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Ser Tyr Glu Leu Thr Gin Pro Pro Ser Vai Ser Val Ser Pro Giy Gin 

1 5 10 15 

Thr Ala lie Thr Cys Ser Gly Asp Xaa Leu Xaa Xaa Xaa Tyr Val Xaa 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Val Leu Val He Tyr Asd 
35 40 45 

Arg Pro Ser Gly He Pro Gin Arg Phe Ser Gly Ser Ser Thr Thr Ala 

50 55 60 

Thr Leu Thr He Ser Gly Val Gin Ala Asp Glu Ala Asp Tyr Tyr Cys 
65 70 75 80 

Gin Xaa Trp Asp Xaa Xaa Xaa Val Val Phe Gly Gly Gly Thr Lys Leu 
35 90 95 

Thr Val Leu Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Asn Phe Met Leu Thr Gin Pro His Ser Val Ser Glu Ser Pro Gly Lys 
15 10 15 

Thr Val Thr He Ser Cys Thr Xaa Ser Xaa Gly He Ala Ser Xaa Tyr 
20 25 30 

Val Gin Trp Tyr Gin Gin Arg Pro Gly Ser Ala Pro Thr Thr Val lie 
35 40 45 

Tyr Glu Asp Asn Arg Pro Ser Gly Val Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Ser Ser Asn Ser Ala Ser Leu Thr He Ser Gly Leu Lys Thr Glu Asp 

65 70 75 80 

Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Xaa Xaa Trp Val Phe 
85 90 95 
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Gly Gly Gly Thr Lys Leu Thr Val Leu Gly 

100 105 

(2} INFORMATION FOR SEQ ID NO: 156: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Asp lie Val Met Thr Gin Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
15 10 15 

Glu Arg Ala Thr lie Asn Cys Lys Ser Ser Gin Ser Val Leu Lys Asn 
20 25 30 

Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu 
35 40 45 

lie Tyr Trp Ala Ser Arg Glu Ser Gly Val Pro Asp Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Ala 

65 70 75 80 

Gin Asp Val Ala Val Tyr Tyr Cys Gin Gin Tyr Tyr Ser Thr Pro Xaa 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Xaa Gly lie Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Ser Glu Leu Thr Gin Pro Pro Ser Val Ser Val Ala Pro Gly Gin Thr 
15 10 15 

Arg He Thr Cys Ser Gly Asp Xaa Leu Gly Xaa Tyr Asp Ala Xaa Trp 

20 25 30 

Tyr Gin Gin Lys Pro Gly Gin Ala Pro Leu Leu Val He Tyr Gly Arg 
35 40 45 

Asn Arg Pro Ser Gly He Pro Asp Arg Phe Ser Gly Ser Ser Ser Gly 
50 55 60 
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His Thr Ala Ser Leu Thr He Thr Gly Ala Gin Ala Glu Asp Glu Ala 

65 70 75 80 

Asp Tyr Tyr Cys Asn Ser Arg Asp Ser Ser Gly Lys Val Leu Phe Gly 
85 90 95 

Gly Gly Thr Lys Leu Thr Val Leu Gly 
100 105 

[2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Ser Ala Leu Thr Gin Pro Pro Ser Ala Ser Gly Ser Pro Gly Gin Ser 
15 10 15 

Val Thr He Ser Cys Thr Gly Thr Ser Ser Val Gly Xaa Xaa Tyr Val 
20 25 30 

Ser Trp Tyr Gin Gin His Gly Ala Pro Lys He Glu Val Arg Pro Ser 
35 40 45 

Gly Val Pro Asp Arg Phe Ser Gly Ser Lys Ser Asn Thr Ala Ser Leu 
50 55 60 

Thr Val Ser Gly Leu Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Ser Ser 
65 70 75 80 

Tyr Xaa Xaa Xaa Xaa Xaa Phe Val Phe Gly Gly Thr Lys Thr Val Leu 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Xaa Xaa 
20 25 30 



-231- 

Xaa Met Xaa Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Xaa Xaa lie Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asp Ser Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Gly Gin Gly 
100 105 110 

Thr Leu Val Thr Val Ser Ser 
115 

INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Xaa 
15 10 15 

Ser Val Xaa Val Ser Cys Lys Xaa Ser Gly Tyr Tyr Phe Xaa Xaa Tyr 
20 25 30 

Xaa lie Xaa Trp Val Arg Gin Ala Pro Gly Xaa Gly Leu Glu Trp Val 
35 40 45 

Gly Xaa lie Xaa Pro Xaa Xaa Gly Xaa Thr Xaa Tyr Ala Pro Xaa Phe 
50 55 60 

Gin Gly Arg Val Thr Xaa Thr Arg Asp Xaa Ser Xaa Asn Thr Ala Tyr 
65 70 75 80 

Met Glu Leu Xaa Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Gly Gin Gly 
100 105 110 

Thr Leu Val Thr Val Ser Ser 
115 

INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Xaa Val Thr Leu Xaa Glu Ser Gly Pro Xaa Leu Val Leu Pro Thr Gin 
15 10 15 

Thr Leu Thr Leu Thr Cys Thr Val Ser Gly Xaa Ser Leu Ser Xaa Xaa 
20 25 30 

Xaa Val Xaa Trp He Arg Gin Pro Pro Gly Lys Xaa Leu Glu Trp Leu 
35 40 45 

Ala Xaa He Xaa Xaa Asp Asp Asp Xaa Tyr Xaa Thr Ser Leu Arg Ser 
50 55 60 

Arg Leu Thr He Ser Lys Asp Thr Ser Lys Asn Gin Val Val Leu Xaa 

65 70 75 80 

Xaa Xaa Xaa Xaa Asp Pro Xaa Asp Thr Ala Thr Tyr Tyr Cys Ala Arg 
85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Val Trp Gly Gin Gly Thr Thr 
100 105 110 

Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Asp He Gin Met Thr Gin Ser Pro Ser Thr Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Ser He Asn Thr Trp 
20 25 30 

Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu Met 
35 40 45 

Tyr Lys Ala Ser Ser Leu Glu Ser Gly Val Pro Ser Arg Phe He Gly 
50 55 60 

Ser Gly Ser Gly Thr Glu Phe Thr Leu Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Asp Asp Phe Ala Thr Tyr Tyr Cys Gin Gin Tyr Asn Ser Asp Ser Lys 
85 90 95 
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Met Phe Gly Gin Gly Thr Lys Val Glu Val Lys 
100 105 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 163 : 

Gin lie Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 

15 10 15 

Glu Lys Val Thr lie Thr Cys Ser Ala Ser Ser Ser lie Ser Tyr Met 
20 25 30 

His Trp Phe Gin Gin Lys Pro Gly Thr Ser Pro Lys Leu Trp lie Tyr 
35 40 45 

Thr Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie Ser Arg Met Glu Ala Glu 

65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys His Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

Phe Gly Ser Gly Thr Lys Leu Glu Leu Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Asp lie Gin Leu Thr Gin Ser Pro Ser Ser Met Ser Ala Ser Pro Gly 

15 10 15 

Asp Arg Val Thr lie Thr Cys Arg Ala Ser Ser Ser lie Ser Tyr Met 
20 25 30 

His Trp Phe Gin Gin Lys Pro Gly Lys Ser Pro Lys Leu Trp He Tyr 
35 40 45 
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Thr 



Thr 

50 



Ser 



Asn 



Leu 



Ala Ser Gly Val Pro 
55 



Ser Arg Phe Ser Gly Ser 

60 



Gly Ser Gly Thr Ser Tyr Thr Leu Thr lie Ser Ser Met Gin Ala Glu 
65 70 75 80 

Asp Phe Ala Thr Tyr Tyr Cys His Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu Leu Lys 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Asp lie Gin Met Thr Gin Ser Pro Ser Thr Leu Ser Ala Ser Val Glv 
1 5 10 15 

Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser He Ser Tyr Met 
20 25 30 

His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr 
35 40 45 

Thr Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Glu Phe Thr Leu Thr He Ser Ser Leu Gin Pro Asp 
65 70 75 80 

Asp Phe Ala Thr Tyr Tyr Cys His Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Val Glu Val Lys 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



100 



105 



100 



105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
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Gln 
1 



Val 



Gin 



Leu Val 

5 



Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser 

10 15 



Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Ara Ser 
20 25 30 

Ala He He Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met 
35 40 45 

Gly Gly He Val Pro Met Phe Gly Pro Pro Asn Tyr Ala Gin Lys Phe 
50 55 60 

Gin Gly Arg Val Thr He Thr Ala Asp Glu Ser Thr Asn Thr Ala Tvr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Phe Tyr Phe Cvs 
85 90 95 

Ala Gly Gly Tyr Gly He Tyr Ser Pro Glu Glu Tyr Asn Gly Gly Leu 
100 105 no 

Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Lys Pro Gly Ala 
1 5 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 30 

Arg Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Thr Gly Tyr Thr Glu Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

Met Gin Leu Ser Ser Leu Thr Phe Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Gly Val Phe Asp Tyr Trp Gly Gin Gly Thr Thr Leu 
100 105 110 

Thr Val Ser Ser 



115 



# 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 168: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Val Ala Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tvr 
20 25 30 

Arg Met His Trp Val Lys Gin Ala Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Thr Gly Tyr Thr Glu Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr Ala Tvr 
65 70 75 30 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Gly Val Phe Asp Tyr Trp Gly Gin Gly Thr Thr Leu 
100 105 HO 

Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 : 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser 
15 10 15 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 30 

Arg Met His Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Thr Gly Tyr Thr Glu Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr He Thr Ala Asp Glu Ser Thr Asn Thr Ala Tyr 



® 
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65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Gly Val Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val 
100 105 110 

Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
TGACTCGCCC GGCAAGTGAT AGTGACTCTG TCTCCCAGAC ATGCAGACAT GGAAGATGAG 6 0 
GACTGAGTCA TCTGGATGTC 8 0 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
TCACTTGCCG GGCGAGTCAG GACATTAATA GCTATTTAAG CTGGTTCCAG CAGAAACCAG 60 
GGAAATCTCC TAAGACCCT 79 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
GATCCACTGC CACTGAACCT TGATGGGACC CCATCTACCA ATCTGTTTGC ACGATAGATC 60 
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AGGGTCTTAG GAGATTTCC 79 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
TGTCGACATC ATGGCTTGGG TGTGGACCTT GCTATTCCTG ATGGCAGCTG CCCAAAGTGC 60 



CCAAGCACAG ATCCAGTTGG TGCAG 



85 
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CLAIMS 

1. A fusion protein comprising (a) a gelonin amino acid sequence that has 
enzymatic activity and (b) a targeting sequence that allows the internalization of said fusion 
protein, wherein said targeting sequence is an antibody, an antigen-binding portion of an 
antibody, a hormone, a lymphokine or a growth factor. 

2. The fusion protein of claim 1, wherein said gelonin sequence is that of SEQ 
ID NO. 2. 

3. The fusion protein of claim 1, wherein said gelonin sequence is that of SEQ 
ID NO. 101. 

4. The fusion protein of claim 1, further comprising a linker sequence between 
said gelonin sequence and said targeting sequence, wherein said gelonin possesses enzymatic 
activity, said antibody is capable of recognizing antigen and said hormone, lymphokine or 
growth factor is capable of binding to a cell that has a receptor for said hormone lymphokine 
or growth factor. 

5. The fusion protein of claim 4, wherein said linker sequence is that of SEQ ID 
NO. 56 or SEQ ID NO. 57. 

6. The fusion protein of any one of claim 1-5, wherein said targeting sequence is 
an antibody. 

7. The fusion protein of any one of claims 1-5, wherein said targeting sequence is 
an antigen-binding portion of an antibody. 

8. The fusion protein of claim 7, wherein said antigen-binding portion of said 
antibody is an Fab. 



• # 
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9. The fusion protein of claim 7, wherein, wherein said antigen-binding portion 
of said antibody is an Fab\ 

10. The fusion protein of claim 7, wherein said antigen-binding portion of said 
antibody is an F(ab f ) 2 . 

11. The fusion protein of claim 7, wherein said antigen-binding portion of said 
antibody is an Fv. 

12. The fusion protein of claim 7, wherein said antigen-binding portion of said 
antibody has a single variable domain. 

The fusion protein of claim 7, wherein said antibody is a single-chain 

The fusion protein of claim 7, wherein said fusion protein is multivalent. 
The fusion protein of any one of claims 1-5, wherein said targeting sequence is 

The fusion protein of any one of claims 1-5, wherein said targeting sequence is 
a lymphokine. 

17. The fusion protein of any one of claims 1-5, wherein said targeting sequence is 
a growth factor. 



13. 
antibody. 

14. 

15. 

a hormone. 
16. 
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ABSTRACT 



The present invention provides purified and isolated polynucleotides encoding Type I 
ribosome-inactivating proteins (RIPS) and analogs of the RIPs having a cysteine available for 
disulfide bonding to targeting molecules. Vectors comprising the polynucleotides and host 
cells transformed with the vectors are also provided. The RIPs and RIP analogs are 
particularly suited for use as components of cytotoxic therapeutic agents of the invention 
which include gene fusion products and immunoconjugates. Cytotoxic therapeutic agents or 
immunotoxins according to the present invention may be used to selectively eliminate any 
cell type to which the RIP component is targeted by the specific binding capacity of the 
second component of the agent, and are suited for treatment of diseases where the elimination 
of a particular cell type is a goal, such as autoimmune disease, cancer and graft-versus-host 
disease. 
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Proteins Encoding Gelonin Sequences fas amended): originally: Fusion Protein s and Polynucleotides Encoding Gelonin 
Sequences) , the specification of which is attached hereto unless the following box is checked: 

K was filed on August 18, 1998 ; 

as United States Application Number or PCT International Application Number 09/136,389; and 
was amended on . (if applicable). 

I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as 
amended by any amendment referred to above. 

*1 acknowledge the duty to disclose information that is material to patentability as defined in 37 CF.R. § 1.56. 

U hereby claim foreign priority benefits under 35 U.S.C. § 1 19(a)-(d) or § 365(b) of any foreign application(s) for patent or 
-inventor's certificate, or § 365(a) of any PCT international application, which designated at least one country other than the 
"United States listed below, and have also identified below any foreign application for patent or inventor's certificate, or PCT 
^international application having a filing date before that of the application on which priority is claimed. 
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this application is not disclosed in the prior United States or PCT international application in the manner provided by the first 
paragraph of 35 U.S.C. § 1 12, 1 acknowledge the duty to disclose information that is material to patentability as defined in 37 
CF.R. § 1.56 that became available between the filing date of the prior application and the national or PCT international filing 
date of this application. 
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Patent 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In the Application of: 

Better and Carroll 

Serial No. : Not yet assigned 
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Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Applicants request entry of the identical computer readable form from a related 
application. Specifically, the computer-readable form of the Sequence Listing in the above- 
identified application is identical to the Sequence Listing in Application Serial No. 09/136,389, 
filed August 18, 1998. In accordance with 37 C.RR. § 1.821(e), please use the computer- 
readable form of the Sequence Listing filed in that application as the computer-readable form in 
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necessary change in application number and filing date for the computer-readable form of the 



Sequence Listing that will be used in the instant application. A paper copy of the sequence 
listing is included in the originally-filed specification of the instant application. 
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